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Abstract 

This  paper  develops  a  model  of  evolving  standards  for  academic  publishing.  It  is  mo- 
tivated by  the  increasing  tendency  of  academic  journals  to  require  multiple  revisions  of 
articles  and  by  changes  in  the  content  of  articles.  Papers  are  modeled  as  varying  along  two 
quality  dimensions:  q  and  r.  The  former  represents  the  clarity  and  importance  of  a  paper's 
main  ideas  and  the  latter  its  craftsmanship  and  polish.  Observed  trends  are  regarded  as 
increases  in  r-quality  A  static  equilibrium  model  in  which  an  arbitrary  social  norm  de- 
termines how  q  and  r  are  weighted  is  developed  and  used  to  discuss  comparative  statics 
explanations  for  increases  in  r.  The  paper  then  analyzes  a  learning  model  in  which  referees 
(who  have  a  biased  view  of  the  importance  of  their  own  work)  try  to  learn  the  social  norm 
from  observing  how  their  own  papers  arc  treated  and  the  decisions  editors  make  on  papers 
they  referee.  The  model  predicts  that  social  norms  will  gradually  but  steadily  evolve  to 
increasingly  emphasize  r-quality. 
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1      Introduction 

I  encourage  potential  readers  of  this  paper  to  first  put  it  down  for  two  minutes  and  thumb 
through  a  thirty-  or  forty-year-old  issue  of  an  economics  journal.  I  anticipate  that  this  will 
convey  much  better  than  I  can  how  dramatically  economics  papers  have  changed  over  the 
last  few  decades.  Papers  today  are  much  longer.1  They  have  longer  introductions.2  They 
have  more  sections  discussing  extensions  of  the  main  results.  They  have  more  references. 
The  process  by  which  papers  get  published  has  also  changed  dramatically.  Around  1960 
the  Quarterly  Journal  of  Economics  reserved  the  "revise-and-resubmit"  option  for  five  or 
so  special  cases  per  year.  In  the  early  1970's  most  journals'  processes  still  let  authors 
submit  their  papers,  receive  reports,  make  revisions  and  get  a  final  acceptance  within 
about  nine  months.  Today  extensive  revisions  arc  the  norm  and  getting  an  acceptance 
takes  20  to  30  months  at  most  top  journals.4  Many  other  academic  disciplines  have  had 
similar  experiences.  While  the  changes  in  academic  journals  arc  widespread  and  dramatic, 
they  have  received  little  attention.  Perhaps  this  reflects  that  the  changes  have  occurred 
gradually  and  that  researchers  in  each  field  see  only  a  small  part  of  the  overall  picture. 

In  this  paper  I  develop  a  model  to  organize  the  observed  trends  and  help  us  think  about 
potential  explanations.  My  central  premise  is  that  we  can  usefully  regard  academic  papers 
as  differing  along  two  quality  dimensions:  q  and  r.  I  think  of  q  as  reflecting  a  paper's  main 
ideas  and  r  as  representing  other  aspects  of  quality  described  by  words  like  craftsmanship, 
thoroughness  and  polish.  I  think  of  the  various  changes  noted  above  as  different  aspects 
of  a  single  phenomenon  —  an  increase  in  the  r-quality  of  published  papers.  I  first  develop 
an  equilibrium  model  of  time  allocation  and  quality  standards  and  use  it  to  discuss  what 


'Ariel  Rubinstein's  1982  Econometrica article,  "Perfect  Equilibrium  in  a  Bargaining  Model,"  for  example, 
is  shorter  than  the  average  note  in  Econometrica  in  2000  (if  one  accounts  for  the  type  sizes). 

For  example,  Amartya  Sen's  "The  Impossibility  of  a  Paretian  Liberal"  and  two  other  papers  in  the 
January- February  1970  issue  of  the  Journal  of  Political  Economy  had  one  paragraph  introductions.  No 
introduction  in  the  Feburary  2000  JPE  was  shorter  than  seven  paragraphs  and  two  were  longer  than  Sen's 
entire  paper. 

An  extreme  example  here  is  Michael  Spence's  1973  QJE  paper  "Job  Market  Signaling."  It  contains  two 
references.  While  my  anectodes  all  relate  to  classic  papers,  old  journals  also  contain  many  derivative  papers 

making  N    '  oider  contributions  to  long  forgotten  literatures.   These  also  look  very  different  from  today's 
papers. 

''The  one  exception  is  the  QJE  where  it  takes  about  13  months.  See  Ellison  (2000)  for  much  more  data 
on  the  duration  of  review  process. 


might  cause  such  an  increase.  I  then  develop  a  dynamic  learning  model  in  which  economists 
struggle  to  reconcile  the  high  standards  to  which  they  arc  being  held  with  the  mediocrity 
they  see  in  journals  (This  occurs  in  part  because  economists  have  biased  views  of  the  quality 
of  their  own  work.)  The  model  predicts  that  social  norms  will  evolve  gradually  to  emphasize 
r-quality. 

I  begin  in  Section  2  with  a  quick  look  at  some  data  on  the  form  of  papers  and  the  length 
of  the  review  and  revision  process  in  a  number  of  disciplines. 

Section  3  describes  a  static  equilibrium  model  of  the  journal  publishing  process.  A 
continuum  of  economists  are  endowed  with  one  unit  of  time  in  which  to  write  one  paper  for 
the  one  journal  in  the  profession.  Papers  are  described  by  their  position  along  two  quality 
dimensions,  q  and  r.  I  interpret  the  q  dimension  as  reflecting  the  clarity  and  importance 
of  the  main  results  of  the  paper,  and  the  r  dimension  as  reflecting  various  other  aspects 
of  quality  (generality,  robustness  checks,  extensions,  etc.)  that  are  typically  improved 
in  revisions  and  when  authors  "polish"  papers  prior  to  submission.  The  main  decision 
economists  make  is  how  to  divide  their  time  between  the  developing  new  ideas  and  working 
on  r-quality.  A  commonly  understood  social  norm  determines  how  the  quality  dimensions 
arc  weighted. 

I  analyze  the  static  model  in  Section  4.  For  a  range  of  parameter  values  the  model 
produces  a  reasonable  outcome.  Papers  initially  submitted  to  the  journal  can  be  divided 
into  three  groups.  Papers  with  the  lowest  q  are  determined  to  be  of  sufficiently  low  interest 
that  no  feasible  revision  could  make  them  acceptable.  They  are  not  revised.  A  second  set  of 
papers  with  intermediate  q's  are  marginal.  These  papers  are  revised  to  the  greatest  extent 
possible,  but  only  some  become  acceptable.  Finally,  papers  with  a  sufficiently  high  q  will 
be  publishable  even  if  they  achieve  a  somewhat  lower  r.  The  authors  of  these  papers  are 
always  able  to  meet  the  revision  standard  that  is  set  for  them. 

I  make  two  main  observations  about  the  static  model.  First,  a  continuum  of  social  norms 
are  possible.  If  the  community  agrees  that  ^-quality  is  very  important,  then  authors  will 
spend  most  of  their  time  developing  main  ideas.  If  r-quality  is  very  important,  then  authors 
spend  very  little  time  on  ideas  and  focus  on  revisions.  Nothing  in  the  model  prevents  either 
extreme  or  something  in  the  middle  from  being  an  equilibrium.  Second,  the  marginal  papers 


that  fall  just  short  of  being  accepted  arc  relatively  low  in  (-/-quality  and  relatively  high  in 
/•-quality  relative  to  the  set  of  resubmitted  papers.  This  is  a  consequence  of  the  assumption 
that  a  paper's  q-quality  is  developed  first  and  revisions  later  increase  r. 

The  model  makes  clear  that  a  number  of  equilibrium  comparative  statics  explanations 
could  be  given  for  increases  in  r-quality.  Such  explanations,  however,  may  not  work  well 
in  practice.  The  observed  changes  in  economics  and  other  fields  are  large.  Accounting  for 
them  requires  cither  a  large  change  in  some  exogenous  variable  or  a  high  sensitivity  of  the 
outcome  to  an  exogenous  variable.  My  impression  from  the  empirical  work  I've  done  is  that 
the  economics  profession  has  not  changed  much  over  the  last  thirty  years.  (Ellison,  2000) 
Equilibrium  explanations  arc  also  hard  to  reconcile  with  editors'  comments  that  changes 
were  not  planned  or  desired.  I  take  these  difficulties  as  motivation  for  exploring  evolutionary 
explanations  —  to  account  for  observed  changes  such  an  explanation  only  need  sto  provide 
a  good  reason  to  think  that  each  year  papers  will  be  a  paragraph  longer  and  require  a 
couple  weeks  more  to  revise  than  the  year  before. 

Section  5  describes  a  dynamic  learning  model.  The  main  actors  in  the  model  arc  the 
journal's  referees  (who  arc  also  the  authors).  The  referees  are  not  self-interested.  They 
simply  try  to  apply  the  prevailing  standards  of  the  profession  in  weighting  7-quality  and 
r-quality  and  in  proposing  a  set  of  improvements  that  would  bring  a  paper  up  to  the  pub- 
lication threshold.  Referees  learn  the  prevailing  standards  from  two  sources:  observations 
of  what  revisions  they  arc  asked  to  make  and  observations  of  whether  editors  eventually 
decide  to  accept  or  reject  papers  they  have  refereed.  Editors  are  not  an  active  force  in  the 
model.  They  simply  fill  the  journal's  slots  by  accepting  the  fraction  r  of  papers  with  the 
highest  average  quality  (weighted  according  to  the  then  prevailing  norm).T  view  this  as  a 
model  of  a  busy  editor.  Making  referees  the  driving  force  is  also  consistent  with  editors' 
comments  that  they  abhor  the  trend  toward  fifty  page  papers  with  myriad  extensions  and 
wish  that  authors  would  just  concisely  explain  what  ideas  they  have. 

Section  6  discusses  the  dynamic  model  with  no  overconfidence  bias.  All  of  the  equilibria 
(if  the  static  model  are  steady-states.  I  note  that  if  referees  try  to  to  hold  authors  to  a 
standard  that  is  infeasibly  high,  then  economists  will  both  realize  that  overall  quality  stan- 
dards arc  lower  than  they  had  thought  and  conclude  that  r-quality  must  be  relatively  more 


important  than  they  had  thought.  The  latter  inference  is  made  because  the  submarginal 
papers  that  are  unexpectedly  accepted  are  relatively  low  in  q  and  relatively  high  in  r. 

Section  7  adds  in  the  assumption  that  economists  are  biased  and  think  their  work  is 
slightly  better  than  it  really  is.5  The  former  steady-states  of  the  model  are  destabilized 
as  referees  try  to  hold  others  to  the  higher  standard  that  they  perceive  is  being  applied 
to  their  own  papers.  Standards  can  not  stray  far  from  the  equilibrium  set  -  -  referees 
would  see  editors  accepting  many  papers  they  strongly  recommended  against  and  this 
would  overwhelm  the  e  bias.  What  happens  instead  is  that  referees  perpetually  try  to  hold 
authors  to  a  standard  that  is  slighly  too  high.  The  observation  of  the  previous  section 
about  how  economists  rationalize  standards  that  are  too  high  still  applies.  The  result  is  a 
gradual  evolution  of  social  norms  to  increasingly  weight  r-quality.  The  slow  steady  form  of 
the  dynamics  makes  them  a  candidate  for  explaining  observed  trends.  In  the  long  run  the 
process  stops  short  of  papers  having  no  q-quality  at  all,  because  the  dynamics  change  once 
no  paper's  idea  is  good  enough  to  make  its  acceptance  a  sure  thing. 

There  has  been  little  related  theoretical  work  on  the  dynamics  of  standards.  The  most 
notable  is  Sobel's  (1998)  analysis  of  a  model  in  which  a  series  of  candidates  work  to  obtain 
admission  to  a  club.  (One  could  think  of  publishing  in  a  journal  as  an  example.)  Judges 
have  heterogeneous  tastes.  The  focus  is  on  how  different  voting  rules  lead  to  rising,  falling 
or  fluctuating  standards. 

The  fact  that  I  have  not  to  followed  the  current  trend  and  given  this  paper  an  overly 
general  title  and  a  seven  page  introduction  should  not  be  taken  to  indicate  that  there  are  not 
broader  lessons  to  be  learned  from  it.  There  are  all  kinds  of  social  norms,  e.g.  standards 
for  politeness,  standards  for  language  and  violence  on  television,  hazing  at  fraternities, 
hours  worked  by  young  doctors  and  lawyers,  years  spent  in  higher  education,  distributions 
of  grades,  etc.   Many  of  these  norms  have  commonly  perceived  trends,  but  other  than  the 


In  the  psychology  literature  this  is  referred  to  as  an  overconfidence  bias.  An  often  discussed  example  is 
Svenson's  (1981)  finding  that  about  90  percent  of  the  U.S.  college  students  in  his  study  estimated  themselves 
to  be  safer  (and  more  skillful)  at  driving  than  the  median  subject.  Psychologists  have  reported  that  experts 
in  many  fields  are  overconfident  in  assessing  their  own  ability  to  answer  questions.  Lichtenstein,  Fischoff 
and  Phillips  (1982)  piovide  a  nice  (but  early)  survey.  Overconfidence  is  frequently  mentioned  in  finance  to 
motivate  the  existence  of  agency  problems  and  to  justify  the  actions  of  noise  traders  as  in  De  Long  et  al 
(1990).  Odean  (1998)  is  a  notable  recent  contribution  with  a  detailed  summary  of  psychology  and  finance 
papers  on  overconfidence.  Rabin  (1998)  discusses  a  number  of  other  psychological  biases  and  their  relevance 
to  economics. 


literature  on  fashion  cycles,  e.g.  Kami  and  Schmeidlcr  (1990)  and  Pesendorfcr  (1995),  most 
of  the  existing  literature  on  social  norms  doesn't  focus  on  change.  I  will  not  try  to  draw 
conclusions  about  other  norms  from  my  results.  My  view  is  that  one  would  need  to  analyze 
each  application  separately  and  think  about  how  the  norm  is  learned  to  assess  whether  a 
drift  should  be  expected  and  if  so  in  what  direction. 

On  a  theoretical  level,  the  paper's  innovation  is  to  note  that  one  can  produce  a  model 
that  explains  a  long  gradual  trend  by  making  a  slight  perturbation  to  a  model  with  a 
continuum  of  equilibria.  In  an  early  presentation  of  this  paper  Robert  Barro  asked  a 
penetrating  question:  'so  arc  you  trying  to  tell  us  that  you're  going  to  explain  a  thirty-year 
trend  by  saying  that  we've  been  out  of  equilibrium  the  whole  time?'  My  answer  is  "Yes!"  I 
hope  to  convince  readers  that  models  of  the  type  I  introduce  make  such  arguments  possible. 

2      Some  data  from  various  disciplines 

Table  1  provides  some  statistical  evidence  on  how  the  form  of  an  academic  article  has 
changed.  The  table  lists  the  average  page  length  and  the  average  number  of  references  for 
articles  in  top  journals  in  a  number  of  disciplines.  Economics  papers  arc  roughly  twice  as 
long  as  they  were  twenty  five  years  ago  and  have  about  twice;  as  many  references.  Jn  almost 
all  fields  papers  seem  to  be  longer  now  than  in  1975.  The  increases  are  more  moderate  in 
the  sciences.  Likewise  with  the  exceptions  of  law  and  history,  articles  now  tend  to  have 
more  references.  While  economics  has  experienced  substantial  reference  growth  it  has  a 
long  way  to  go  to  catch  many  social  sciences. 

Table  2  provides  some  evidence  on  how  long  it  takes  to  review  papers  and  on  how 
extensively  they  are  revised  for  publication  in  a  number  of  fields.  In  almost  every  case  it 
takes  longer  to  get  a  paper  accepted  now  than  it  did  in  1975.  While  Econometrica  and 
the  Review  of  Economic  Studies  have  the  most  drawn  out  publication  processes  among 
the  listed  journals,  similar  trends  are  visible  in  computer  science,  psychology,  statistics, 
linguistics  and  finance,  and  I've  been  told  that  many  rounds  of  revisions  are  also  the  norm 
at  the  top  journals  in  marketing,  political  science,  and  a  number  of  other  social  sciences 
A  slowdown  is  also  visible  in  some  sciences,  but  the  time  scale  is  completely  different. 


3      The  static  model 

In  this  section  I  describe  a.  simple  static  model  of  academic  publishing.  The  main  actors  in 
the  model  are  a  continuum  of  economists  (of  unit  mass).  Each  economist  is  endowed  with 
one  unit  of  time  and  may  write  one  paper.  There  is  one  economics  journal  that  publishes  a 
mass  r  of  papers  with  0  <  r  <  1.  Economists  preferences  arc  lexicographic  in  publications 
and  leisure  time,  i.e.  they  attempt  to  maximize  the  probability  of  publishing  an  article  in 
the  journal;  holding  the  probability  of  publication  fixed  they  prefer  more  leisure  to  less. 

Papers  can  be  fully  described  by  two  dimensions  of  quality,  q  and  r.  The  g-quality 
dimension  is  intended  to  reflect  the  inherent  importance,  interest,  clarity  etc.  of  the  main 
ideas  of  the  paper.  The  r-quality  dimension  is  intended  to  reflect  additional  aspects  of 
quality  that  authors  improve  when  they  arc  asked  in  revisions  to  improve  and  tighten  a 
paper's  exposition,  to  make  clear  relationships  to  other  papers  in  the  literature,  to  generalize 
theoretical  results,  to  check  the  robustness  of  empirical  results,  to  extend  the  analysis  to 
consider  tela  ted  questions,  etc. 

Social  norms  for  evaluating  papers  are  assumed  to  be  common  and  commonly  known. 
Under  the  (a,  z)-social  norm.,  papers  arc  regarded  as  worth  of  publication  if  and  only  if 
aq  +  (I  —  a)r  >  z.  The  parameter  a  may  reflect  two  different  value  judgements.  In 
addition  to  reflecting  what  people  think  makes  a  paper  valuable,  it  can  also  reflect  what 
people  think  authors  should  be  required  to  do.  For  example,  a  referee  might  argue  that 
while  he  feels  that  a  particular  high  q,  low  r  paper  is  "better"  than  the  marginal  paper 
in  a  journal,  it  should  still  be  rejected  because  the  good  idea  does  not  excuse  the  author's 
failure  to  make  r  improvements  required  of  everyone  else. 

The  timeline  of  the  model  is  illustrated  below.  While  the  model  is  described  as  a  four 
step  process  with  three  groups  of  players,  a.t  the  moment  the  authors  are  the  only  ones 
acting  in  a  nonmechanical  way. 


t=  1 


Economists 

choose 

tq  e  [0,  l] 


Referees 
assess  q 

report  r(q) 


t  =  3 


Economists 
choose 

tr  e  [o,  i  -  g 


Editors 
say 

yes/ no 


In  the  first  stage  of  the  model,  authors  choose  the  fraction  tq  G  [0,1]  of  their  time  to 
devote  to  thinking  up  and  developing  the  main  ideas  of  the  paper.  The  result  is  a  paper 
of  q-quality  q  ~  F(q\tq).  Assume  that  F  is  continuously  differentiablc  in  tq  and  for  each 
tq  F  has  an  everywhere  positive  density  f{q\tq)  on  the  interval  [0, rn(tq)]  (with  m{tq)  >  0 
being  possibly  infinite).  Natural  specifications  will  have  the  q  distribution  increasing  in  tq. 
For  example,  q  might  be  assumed  to  be  uniformly  distributed  on  [0,  tq]  or  exponentially 
distributed  with  mean  tq. 

In  the  second  stage  of  the  model  authors  submit  their  papers  to  the  journal.  The 
journal's  referees  correctly  assess  the  quality  q  of  the  paper  and  report  that  it  will  be 
acceptable  for  publication  if  and  only  if  authors  are  able  to  revise  it  and  achieve  an  r- 
quality  of  at  least  r(q)  as  defined  by  aq  +  (1  —  a)r(q)  =  z.  In  practice  one  can  think  of 
r(q)  as  a  measure  of  the  number  of  improvements  rcfereees  ask  for  in  their  reports  and  the 
difficulty  of  the  tasks. 

In  the  third  stage  authors  choose  the  amount  of  time  tr  (E  [0, 1  —  tq\  to  spend  on 
revisions.  The  production  of  /--quality  is  again  a  random  process.  Specifically,  assume  that 
r  =  h(t.r)  +  rj,  where  77  is  a  random  variable  uniformly  distributed  on  [0,  a]  with  a  >  0. 
Assume  that  the  production  of  revisions  is  a  decreasing  returns  activity  with  h(0)  =  0, 
h!  >  0  and  h"  <  0.  To  ensure  that  time  will  be  allocated  to  both  dimensions  of  quality  I 
assume  also  that  h'(0)  =  00,  and  h'(l)  =  0. 

In  the  fourth  stage,  editors  accept  the  fraction  r  of  papers  for  which  aq  +  (1  —  a)r  is 
highest  for  publication. 

An  equilibrium  of  the  model  is  a  quadruple,  (a,z,t*,t*(q))  such  that  r*  and  t*(q)  are 
chosen  to  maximize  the  probability  that  aq  +  (1  -  a)r  >  z  (and  are  as  small  as  possible  if 
there  are  multiple  choices  that  yield  the  same  probability  of  publication)  and  such  that  the 
fraction  of  papers  with  aq  +  (1  -  a)r  >  z  is  exactly  r.  I  will  refer  to  (a,  z)  as  a  consistent 
social  norm,  if  there  exist  choices  of  tq  and  tr(q)  for  which  (a,  z,  tq,  tr(q))  is  an  equilibrium. 

4      Analysis  of  the  static  model 

In  this  section  I  analyze  the  model  described  in  the  previous  section.  I  wish  to  highlight  two 
main  observations.    First,  many  different  social  norms  arc  possible.   Second,  the  marginal 


accepted  papers  tend  to  have  relatively  low  ^-quality  and  relatively  high  r-quality  in  the 
universe  of  published  papers. 

4.1      Characterization  of  equilibrium 

The  analysis  of  the  equilibrium  is  a  straightforward  backward  induction  argument.  Consider 
an  equilibrium  (a,  z,  t*  t*{q))-  Because  of  the  lexicographic  preference  for  publications  over 
leisure,  at  t  =  3  authors  will  devote  all  of  their  remaining  time  to  improving  their  paper's 
r-quality  unless  the  paper  is  sure  to  be  rejected  anyway  or  is  sure  to  be  accepted  even  if 
less  time  is  devoted  to  revisions.  As  a  result  we  have, 

Proposition  1  In  any  equilibrium  of  the  model,  let  q  =  (z  —  (1  —  a)(h(l  —  t*)  +  o~))/a  and 
q={z-{\-  a)h{\  -  t*))/a.    Then,  q<q  and 

(0  if  q<q, 

[   h-l((z-aq)/(l-a))     if  q  <  q. 

(For  this  proposition,  define  h~  (x)  =  0  if  x  <  0.) 

Note  that  q  may  be  less  than  zero  and  q  may  be  greater  than  the  upper  bound  of 
the  support  of  q.  Hence,  it  is  possible  that  either  of  the  extreme  cases  may  not  arise  for 
particular  parameterizations  of  the  model. 

In  the  first  stage,  the  time  tq  allocated  to  trying  to  develop  the  main  ideas  for  a  paper 
will  be  chosen  to  maximize  the  probability  of  eventual  publication.  Write  G{z\t)  for  the 
probability  that  aq  +  (1  —  a)r  is  at  most  z  when  t,q  =  t  and  rr  is  chosen  optimally  as  in  the 
above  proposition.  Note  that  this  probability  is  the  same  as  it  would  be  if  tr  was  simply 
set  equal  to  1  —  tq.  Hence, 

G(z;t)=J_  maxfl,-  a L)f(q\t)dq 

Note  that  G  is  uniformly  continuous  in  z  and  t.  It  is  strictly  increasing  in  z  whenever 
G(z:t)  is  strictly  between  zero  and  one.  Write  G~i{p;t)  foi  the  inverse  of  this  function  for 
a  fixed  t.  The  equilibrium  time  allocation  is  easily  described  by 


Proposition  2  In  the  first  stage  of  any  equilibrium.,  the  time  allocated  to  developing  q- 
quality  satisfies 

t*  G  Argm.axtG~\\  -  r\t). 

Proof 

In  equilibrium,  each  economist's  paper  is  accepted  with  probability  r.  Hence,  z  must 
satisfy  G(z;t*)  =  (1  -  r).  If  r*  does  not  belong  to  Argmax£G-1(l  -  r;t),  then  any  t  which 
does  maximize  that  expression  has  G_1(l  —  r;t)  >  G_1(l  -  r;  t.*)  =  z.  Because  G  is  strictly 
increasing  in  z  whenever  0  <  G(z;  t)  <  1.  this  implies  that  G(z,  t)  <  G(G_1(1  —  t;  t))  =  r, 
which  contradicts  the  optimality  of  t*. 

QED. 

4.2      Some  examples 

Figure  1  illustrates  the  distribution  of  paper  qualities  in  a  "typical"  equilibrium  —  in  this 
case  generated  by  assuming  that  t  =  03,  q  is  uniformly  distributed  on  [0,  r9],  the  technology 
for  r  production  is  h(tr)  =  y/tT  —  tr/2  with  a  =  0.2,  and  the  social  norm  forjudging  papers 
has  a  =  0.5  and  z  as  0.504. 

For  those  parameters,  the  equilibrium  effort  allocated  to  q  production  turns  out  to  be 
t*  ~  0.S2G.  All  three  possible  outcomes  of  a  submission  occur.  Authors  of  papers  with 
g-quality  less  that  q  ~  0.48  realize  that  their  papers  have  no  chance  of  becoming  acceptable 
and  do  not  attempt  to  revise  them.  Authors  of  papers  with  q  €  (q,q)  ~  (0.48,0.68)  devote 
as  much  time  as  possible  to  revising  (setting  tr  =  1  —  t*  %  0.174)  and  have  their  papers 
accepted  with  probability  strictly  between  zero  and  one.  Authors  of  papers  with  quality 
q  G  [<?,  m(tq)  —  t*]  x  [0.68,0.83]  do  the  minimal  revision  necessary  to  ensure  that  their 
papers  will  be  accepted  with  probability  one. 

The  figure  shows  the  outline  of  the  support  of  the  equilibrium  distribution  of  paper 
qualities  in  (q, r)-space.  Paper  qualities  are  distributed  with  a  constant  density  within 
these  regions.  Papers  in  the  lower  left  box  are  those  for  which  authors  set  tr  =  0.  These 
papers  are  never  accepted.  The  upper  right  region  is  divided  into  a  triangle  of  papers  thai 
are  revised  then  rejected  and  a  trapezoid  of  papers  that  are  accepted.  The  mass  of  papers 
in  this  upper  region  is,  of  course,  r. 
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Figure  1:  The  equilibrium  distribution  of  paper  qualities  in  a  "typical"  equilbrium:  r  =  0.3, 
q~U[0,tq\:  h(t)  =  y/i-t/2,  a  =  0.2,  a  =  0.5,  z  as  0.504. 

The  form  of  the  equilibrium  seems  to  reflect  fairly  well  the  functioning  of  an  economics 
journal  One  observation  I'd  like  to  make'  is  that  the  "marginal'7  rejected  papers  have 
relatively  low  ^-quality  compared  to  the  pool  of  accepted  papers  — -  they  all  have  q  £  [q,q\. 
The  marginal  rejected  papers  are  not  relatively  low  in  /--quality.  Their  authors  have  spent 
as  much  time  as  possible  revising  and  achieved  r-qualitics  that  are  on  a.verage  superior  to 
those  of  accepted  papers. 

While  I  think  that  the  case  illustrated  above  is  the  primary  one  of  interest  and  subse- 
quent arguments  focus  on  it,  the  equilibrium  can  take  other  forms  for  different  parameter 
values.  Most  notably,  when  a  is  sufficiently  small  (i.e.  g-quality  is  of  little  importance) 
all  authors  set  tT  =  t*  —  there  will  always  be  some  chance  that  any  idea,  no  matter  how 
vacuous,  can  be  developed  into  a  publishable  paper  and  no  paper's  idea  is  good  enough  to 
make  its  eventual  acceptance  a  sure  thing.  The  following  proposition  formalizes  this  obser- 
vation. Figure  2  graphs  the  equilibrium  distribution  of  paper  qualities  and  the  acceptance 
and  rejection  regions  for  such  a  case:  q  ~  U[0,  tq],  h(tr)  =  v^T  —  tr/2,  o  =  0.2,  a  =  0.2  and 
z«  0.532. 

Proposition  3  Suppose  that  the  upper  bound  of  the  q- distribution,  m(tq).  is  finite  and 
uniformly  bounded  for  all  tq  E  [0, 1].  Then,  there  exists  a  >  0  such  that  for  all  a  €  (0,  a), 
t*{q)  =  1  —  t*  for  all  q  6  [0,  rn.(t*)],  i.e.   all  papers  are  reuised  to  the  greatest  extent  possible 
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Figure  2:    An  example  of  the  equilibrium  quality  distribution  in  a  "low  a"  equilibrium: 
T  =  0.3,  q  ~  U[0,  tq),  h(t)  =y/i-  t/2,  a  =  0.2,  a  =  0.2,  z  tv  0.532. 

and  no  paper  achieves  a  level  of  q- quality  sufficient  to  ensure  that  it  will  be  accepted  with 
probability  one. 


Proof 

To  see  that  q  <  0  note  that  if  a  paper  with  q  =  0  is  revised  to  the  greatest  extent  possible 
and  gets  the  best  possible  draw  on  r-quality  its  overall  quality  will  be  (1  —  a)h(l  —  t*)  +  a. 
In  equlibrium, 

Probjary  +  (1  -  a)r  >  (1  -  a){h(l  -tl)  +  a)}     <     Prob{r  >  Ml  -  tl)  +  a -m(t*)} 

7  ¥  1  —  a       H 


< 


a 


•m{t*)f<j  —>  0  as  q  — »  0. 


1  -a    '"  q' 

For  a  sufficiently  small  the  expression  is  less  than  r,  and  hence  there  is  a  positive  probability 
of  the  paper  with  q  =  0  being  acceptable. 

Similarly,  to  see  that  q  >  m(t*)  note  that  if  a  paper  with  g-quality  m(t*)  is  revised  to 
the  greatest  extent  possible  but  gets  the  worst  possible  draw  on  r-quaHty  its  overall  quality 
level  is  am{t*)  +  (1  -  a)h(l  -  t*)  and 


Pvob{a<7+  (1  -  a)r  >  am(i!)  +  (1  -  a)Wl  -  t*)}     >     Prob{r  >  h(l  -  t*)  + 


H         1  —  a        H 


>     1 


<xm(t*q) 
cr(l-a) 


1  as  a  -»  0. 
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Hence,  for  any  r  <  1  the  probability  that  a  paper  with  g-quality  m(t*)  fails  to  be  among 
the  best  r  is  strictly  positive  if  a  is  sufficiently  small. 

QED. 

Some  of  the  results  I'll  give  later  will  depend  on  the  form  of  the  equilibrium.  To  simplify 
the  statements  of  these  results  I  will  give  the  forms  pictured  in  the  figures  (and  a  couple  of 
other  forms)  names. 

Definition  1  /  will  say  that  an  equilibrium  is  "typical"  or  has  the  typical  form  if  0  <  q  < 
q  <  m(t*}  and  r(m.(t.*))  >  0.  An  equilibrium,  has  the  "low  a"  form,  if  q  <  0  <  m(t*)  <  q. 
It  has  the  "som.ewhat  low  a"  form,  if  0  <  q  <  m,(t*)  <  q.  It  is  of  the  "high  a"  form  if 
0<q<q<  m(t*)  and  r(m(t*))  <  0. 

The  somewhat  low  a  form  is  similar  to  the  low  a  form,  with  the  only  difference  being 
that  in  the  former  authors  of  the  lowest  q  papers  do  not  revise  their  papers.  The  high,  a  form 
is  similar  to  the  typical  form,  but  the  highest  q  papers  are  so  good  that  r(q)  is  negative,  i.e. 
referees  tell  die  authors  that  even  if  they  revised  the  paper  to  make  it  worse  it  would  still 
be  publishablc.  The  authors  of  these  papers  obviously  exert  no  effort  on  revisions  and  have 
their  papers  accepted  with  quality  to  .spare.  In  the  model  with  q  ~  U[0,  tq\,  h(f)  =  \/i—  r/2, 
a  —  0.2  and  r  =  0.3  the  equilibrium  has  the  low  a  form  for  a  €  (0,0.2285),  the  somewhat 
low  a  form  for  a  G  (0.2286,0.3363),  the  typical  form  for  a  €  (0.3364,0.5869)  and  the 
high  a  form  for  a  £  (0.5870, 1).  In  other  specifications  for  the  model  the  equilibrium  can 
take  on  other  forms.  For  example,  if  the  distribution  of  q  is  unbounded  (or  r  is  large)  we 
will  simultaneously  see  papers  of  the  lowest  g-quality  being  resubmitted  and  papers  of  the 
highest  g-quality  being  accepted  with  no  revisions. 

4.3      The  multiplicity  of  consistent  social  norms 

In  the  model  described  above  not  all  social  norms  arc  consistent.  If  there  is  only  room 
in  the  journal  for  a  small  fraction  of  papers,  then  the  quality  threshold  z  must  be  high. 
Thi.v  however,  is  really  the  only  constraint.  .Nothing  in  the  model  restricts  the  weight  the 
community  places  on  7-quality  versus  r-quality.  There  are  a  continuum  of  consistent  social 
norms  with  any  a  being  possible.  The  following  proposition  gives  a  formal  statement  to  this 
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effect,  and  Figure  3  graphs  the  set  of  consistent  social  norms  for  the  model  with  q  ~  U[0,  tq], 
h{t)  =  s/i.  -  t/2,  a  =  0.2  and  r  =  0.3. 

Proposition  4  In  the  m.odel  described  above,  for  any  a  6  [0,  1]  there  exists  an  unique  z* (a) 
such  that  (a,z*(a))  is  a  consistent  social  norm,. 

Proof 

For  any  fixed  a,  let  G{z;t)  be  the  CDF  of  aq+(l-a)r  as  above.  Let  H(z)  =  inft  G(z;t). 
Because  G  is  uniformly  continuous,  t  is  chosen  from  a  compact  set  and  lim2_4oo  G(z;  t)  —  1 
for  all  t,  II  is  continuous  with  H(0)  =  0  and  limz_oo^(z)  =  1.  Hence  there  is  a  solution 
z*(a)  to  H(z)  =  1  —  t.  (a,  2*(q))  is  a  consistent  social  norm. 

It  is  not  possible  for  both  (a,  z)  and  (a,  z')  to  be  consistent  social  norms  with  z  <  z'. 
In  that  case,  an  agent  setting  t.q  equal  to  the  the  equilibrium  choice  under  the  (a,z')  norm 
would  surpass  the  z  threshhold  with  piobability  greater  than  r. 


QED. 
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Figure  3:   Consistent  social  norms  (a,  z) 
r  =  0.3.  and  a  =  0.2. 


in  the  model  with  q  ~  U[0,tq],  h(t)  =  y/t  -  t/2, 


4.4      Explanations  for  observed  increases  in  /-quality 

My  view  of  the  trends  in  economics  papers  mentioned  in  the  introduction  is  that  they  can 
all  be  thought  of  as  aspects  of  an  increase  in  the  r-quality  of  published  papers.  The  model 
makes  clear  that  a  number  of  different  explanations  for  such  a  change  are  possible. 
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The  explanation  I  focus  on  in  the  latter  half  of  this  paper  is  that  the  r-quality  of 
published  papers  will  increase  if  a  decreases,  i.e.  if  social  norms  shift  to  place;  relatively  less 
emphasis  on  q-quality  and  relatively  more  emphasis  on  r-quality.  This  raises  the  r-quality 
of  published  papers  for  two  reasons:  economists  react  by  allocating  more  of  their  time  to 
producing  r-quality;  and  when  choosing  from  the  pool  of  resubmitted  papers  the  journal 
places  more  emphasis  on  r-quality. 

Other  plausible  explanations  do  not  require  a  change  in  social  norms  for  trading  off  q 
and  r.  One  is  that  the  technologies  for  producing  q  and  r  may  have  shifted  over  time.  For 
example,  one  could  argue  that  economists  have  begun  to  exhaust  the  set  of  all  possible  ideas 
and  model  this  as  a  change  in  F{q\tq)  that  reduces  the  marginal  benefit  of  fq.  Similarly, 
one  could  argue  that  advances  in  computer  technology  or  accumulated  knowledge  allow 
authors  to  produce  more  r-quality  per  unit  time.  A  second  potential  explanation  is  that 
publishing  in  the  top  journals  may  have  become  more  competitive.  We  might  think  of  the 
growth  of  the  profession  and  other  factors  as  equivalent  to  a  decrease  in  r.  and  this  might 
raise  quality  standards  in  both  dimensions.6 

The  latter  explanation  is  not  straightforward,  so  I'll  discuss  it  further.  Intuitively, 
reducing  r  has  two  effects.  First,  it  allows  the  journal  to  be  more  selective  when  choosing 
from  the  pool  of  resubmitted  papers.  Second,  it  affects  authors'  time  allocation  decisions. 
If  authors  react  to  increased  competition  by  gambling  on  bold  projects  that  require  a  lot 
of  t.q.  the  second  effect  could  offset  the  first  and  make  the  overall  effect  of  competition  on 
r-quality  ambiguous. 

Formally,  I  should  note  first  that  nothing  in  the  assumptions  so  far  guarantees  that 
C_1(l  —  r;  tq)  (which  plays  the  role  of  the  economists'  objective  function  when  tq  is  selected) 
is  quasi  concave.  If,  for  example,  q  ~  [/[0,  kt?}\  with  n  >  1,  then  the  production  of  g-quality 
would  involve  increasing  returns.  In  some  cases,  both  both  0  and  some  higher  value  of  tq  will 
be  local  optima  and  t*  will  change  discontinuously  in  the  parameters.  To  provide  as  simple 
a  discussion  as  I  can  of  the  potential  effects  of  r  on  r-quality,  I  will  avoid  such  complications 
by  just  assuming  that  F(q:t)  is  such  that  t*  is  the  unique  solution  to  ^(z*(a.r);  t)  =  0 


Ellison  (2000)  notes  that  by  many  measures  the  growth  in  the  profession  over  the  last  30  years  has  been 
moderate,  but  that  decreases  in  the  number  of  articles  published  by  some  top  journals  and  an  increase  in  the 
relative  status  ol  the  top  journals  may  have  also  contributed  to  a  more  substantial  increase  in  competition. 
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among,  values  of  t  for  which  G{z*{a,r)\f)   <   1  and  that  the  probably  of  achieving  the 
standard  is  strictly  concave  at  the  optimum,  i.e.   ^gjT(z"(a,T):t*)  >  0. 


Recall  that  t*{r)  solves 


dG 
dt 


~(z*(r);t*q(r))=0. 


Differentiating  with  respect  to  r  gives 

z*  is  always  decreasing  in  r  and  we  have  assumed  that  ^p~  is  positive.  Hence,  t*  will  be 
decreasing  in  r  if  §^§i{z" '{r);t*(T))  <  0.  To  think  about  the  sign  of  this  derviative  note 
that  |J(z;  tq)  is  the  density  of  the  overall  quality  distribution.  If  the  q  distribution  becomes 
more  spread  out  when  authors  spend  more  time  thinking  of  new  ideas  and  the  production 
of  r-quality  is  a  more  deterministic:  process  in  which  increases  in  tT  only  shift  the  mean 
r-quality  [as  we've  assumed),  then  the  natural  outcome  is  for  the  density  to  be  decreasing 
in  tg,  in  which  case  the  Jj^f  is  negative.  In  this  case,  t*  will  increase  as  r  decreases,  and 
the  overall  effect  of  r  on  r  quality  may  be  ambiguous. 

For  a.  more  direct  intuition  note  that  t*  reflects  authors  setting  a  times  the  effect  of  tq  cm 
q-quality  equal  to  (1  -  a)  times  the  effect  of  rr  on  r-quality  conditional  on  their  luck  being 
such  that  their  paper  is  marginal  for  the  journal.  When  the  journal  is  highly  nonselective, 
this  is  conditioning  on  getting  a  very  bad  draw  from  the  q  distribution.  When  the  journal 
is  extremely  selective,  the  conditioning  is  on  getting  nearly  as  a  good  a  draw  as  possible  on 
q.  With  a  functional  form  like  F{q\tq)  ~  (7[0,  tq],  the  higher  percentiles  of  the  q  distribution 
are  more  sensitive  to  tq  investcments.  Hence  the  return  to  tq  is  greater  when  one  conditions 
on  the  paper  being  marginal  for  a  more  selective  journal. 

Figure  4  illustrates  the  effects  of  changes  in  r  on  time  allocation  and  quality  in  a  model 
with  q  ~  (7[0,fJ,  h{t)  =  Vt  -  t/2,  a  =  0.2,  and  a  =  0.5  The  left  panel  shows  that  t*  is 
monotonically  decreasing,  i.e.  /*  increases  as  the  journal  becomes  more  competitive.'  The 
right  panel  shows  the  effect  of  r  on  the  mean  q-  and  r-quality  of  the  published  papers.  The 

The  shape  of  the  figure  when  r  is  near  zero  is  sensitive  to  the  distributional  assumptions.  \Vi<h  the 
uniform  specification  the  maximum  possible  return  to  /,  is  bounded  and  t"q  is  bounded  away  from  one  as  T 
approaches  zero.  If  q  had  been  specified  as  exponentially  distributed  with  parameter  tq,  the  effect  of  r,  on 
the  1  —  t1  percentile  of  the  q  distribution  would  grow  without  bound  as  r  approaches  zero  and  the  time 
spent  on  revisions  would  decrease  all  the  way  to  zero  in  the  limit. 
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relationship  beween  r  and  r-qnality  is  nonmonotonic.  At  some  points  the  effect  of  r  on  t* 
dominates,  while  at  others  the  effect  of  journal  selectivity  dominates. 


$ 


1         T 

Time  allocation 


Mean  q 
Mean  r 


1      r 
Mean  quality  of  published  papers 


Figure  4:  Effect  of  journal  selectivity  on  time  allocation  and  observed  paper  quality 

5      A  dynamic  model  of  evolving  norms 

The  static:  model  above  has  a  continuum  of  equilibria  corresponding  to  different  social 
norms.  In  this  section  I  describe  a  dynamic  model  of  the  evolution  of  norms.  The  model 
involves  a  population  of  author-referees  who  are  trying  to  learn  and  apply  the  profession's 
standards.  In  contrast  to  what  one  might  think  when  one  hears  the  term  "learning  model" , 
I  will  not  model  agents  as  arriving  with  different  beliefs  and  examine  whether  there  is 
convergence  to  a  common  belief.  Instead,  the  model  will  be  constructed  so  that  agents  will 
have  common  beliefs  at  every  point  in  time  and  I  will  focus  on  whether  agents'  attempts 
to  learn  the  prevailing  norm  lead  to  a  shift  in  norms.8 

The  model  involves  a  discrete  set  of  time  periods  t  =  0, 1, 2,. . . .  At  the  start  of  period 
t,  all  economists  believe  that  that  the  social  norm  is  (at,zt).  They  then  write  a  paper  and 
try  to  publish  it  as  in  the  static  game  of  Section  2  and  serve  as  a  referee.  The  data  they 
receive  via  referee  reports  and  editorial  decisions  will  suggest  to  them  that  the  social  norm 


This  focus  is  nol  intended  to  suggest  that  whether  a  community  will  reach  a  common  norm  and  the 
effects  of  belief  or  preference  heterogeneity  on  I  he  evolution  of  social  norms  are  not  interesting.  Instead,  as 
in  Ellison  and  Fudenbeig  (2000),  the  common  beliefs  learning  model  is  motivated  solely  by  the  desire  for  a 
tractable  model  that  highlights  an  important  effect. 
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is  in  fact,  (at,  zt).  This  leads  them  to  alter  their  beliefs  aceording  to 


at+i  -at  , 


zt+i  -  zt 

for  some  constant  A:  >  0.  To  complete  the  specification  I  must  describe  what  data  economists 
gather  and  how  they  draw  the  inferences  (d(,  zt)  from  the  data. 

What  data  do  economists  get?  I  assume  that  economists  get  two  types  of  data  points. 
First,  when  an  author  submits  a  paper  of  quality  at  least  q  I  assume  that  the  referee  reports 
he  or  she  receives  give  him  or  her  a  data  point  of  the  form  (q,  r(q))9  These  datapoints 
should  all  lie  on  the  line  atq  +  {1  -  at)r(q)  =  zt.  I  will  allow,  however,  for  the  possibility 
that  economists  arc  subject  to  an  overconfidence  bias  when  judging  the  quality  of  their 
own  work.  In  particular,  I'll  assume  that  they  overestimate  the  r-quality  of  their  initial 
submission  and  this  leads  them  to  believe  that  they  have  been  required  to  achieve  an  r- 
quality  that  is  e  higher  than  what  they  have  actually  been  required  to  achieve.10  In  this 
case,  the  (q,r(q))  datapoints  actually  lie  on  the  line  atq  +  (1  —  at)r(q)  =  zt  +  (1  —  at)e. 

Second,  whenever  an  economist  referees  a  paper  that  is  of  sufficiently  high  quality  to  be 
resubmitted  he  or  she  gets  a  data  point  of  the  form  (g,r,  Accept/Reject).1112  Economists 
expect  to  sec  all  papers  lying  above  the  line  atq  +  (1  —  Q/)r  =  zt  being  accepted  and  all 
papers  lying  below  this  line  being  rejected. 

If  economists  each  saw  a  finite  number  of  data  points,  then  their  analyses  would  lead 
to  a  divergence  in  the  second  period  beliefs  even  if  they  entered  the  model  with  common 
beliefs.  To  keep  the  model  tractable  I  avoid  this  by  invoking  word-of-mouth  communication. 


Note  that  in  a  slight  departure  from  the  static  model  I've  assumed  that  journals  do  not  provide  the 
author  with  a  list  of  revisions  sufficient  to  make  the  paper  publishable  if  the  paper's  q-quality  is  so  low 
as  to  make  it  inconceivable  that  a  revision  will  be  publishable.  I  believe  that  this  is  a  good  description 
of  journal  practices.  I  did  not  try  to  incorporate  such  behavior  by  referees  in  the  static  model,  however, 
because  it  seemed  a  needless  complication  and  because  it  creates  a  possibility  for  another  type  of  equilibrium 
multiplicity  that  I  did  not  feel  was  important:  authors  won't  save  time  for  revisions  if  referees  won't  ask  for 
large  revisions  because  they  don't  think  they're  feasible. 

Given  that  the  acceptance  fronteir  is  downward  sloping  in  q-r  space  this  assumption  is  almost  equivalent 
to  assuming  that  authors  believe  their  papers  to  be  of  slightly  higher  q-quality  than  they  actually  are,  or 
to  assuming  that  have  biased  views  of  both  the  q-  and  r-quality  of  their  work.  I've  chosen  the  formulation 
above  because  it  makes  some  results  a  little  cleaner  (especially  those  about  small  a  behavior.) 

Another  important  data  source  for  real  world  economists  is  reading  journals.  Given  the  word-of-mouth 
assumption  below,  this  would  just  provide  redundant  observations  on  all  of  the  acceptances.  For  this  reason, 
nothing  would  be  changed  if  1  included  this  data  source  in  the  model. 

An  ambiguity  in  the  model  is  what  happens  if  the  assumed  standards  is  so  excessively  high  that  fewer 
than  r  papers  are  resubmitted.  In  the  simulations  in  the  next  section  I  assume  that  in  this  case  editors 
accept  some  papers  that  aie  not  resubmitted  and  that  these  acceptances  are  observed  by  the  referees. 
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I  assume  that  each  economist  talks  to  every  other  economist  in  each  period  and  thereby 
sees  all  of  the  data  points  that  were  generated  in  that  period.  While  this  makes  each 
author's  own  experiences  a  measure  zero  subset  of  his  or  her  datasct,  I  do  not  want  to  lose 
the  possibility  of  inferences  being  affected  by  the  authors'  misperceptions  of  the  quality  of 
their  own  work.  I  thus  assume  that  the  {q,r(q))  observations  economists  receive  by  hearing 
others  taik  about  the  referee  reports  they  received  are  contaminated  by  the  author's  bias 
(and  that  the  listeners  do  not  realize  this). 

What  do  economists  do  with  the  data  they  obtain  each  period?  Fitting  the  (a,  z)  model 
involves  estimating  the  slope  and  intercept  of  a  line  that  fits  the  {q,r(q))  data  and  divides 
the  acceptance  and  rejection  regions.  Typically,  no  line  will  do  both  jobs  perfectly.  I  assume 
that  this  does  not  cause  economists  to  lose  faith  in  their  model  of  the  world  and  that  they 
go  ahead  and  try  to  fit  the  data  as  well  as  possible  with  the  («,  z)  model.13 

Formally,  I  assume  that  economists'  period  t  analyses  take  the  form 

(dt,  zt)  =  argminLfa,  zj/zi,^), 

a,  z 

where  L(a,z)  is  a  loss  function  tha.t  describes  how  poorly  the  the  data  (a  measure  /.i\ 
describing  the  (q,  r(q))  points  and  a  measure  \i2  describing  the  (q,  r.  Accept  /Reject)  points) 
fit  the  hypothesis  that  all  referees  and  the  journal  editor  are  applying  the  (a,  z)  social  norm. 
Specifically,  I  assume  that 

L(a, z; m,  fi2)  =  Li(a,z;m)  +  L2(a,z;fi2), 

where 


Li(a, z; fj,i)  =   /  (r{q)-  —        I    d/zi(g), 


^  2 
a  J 

is  a  standard  mean  squared  deviation  measure  of  the  distance  (in  the  r-dimension)  between 
the  {q,r(q))  data  points  and  the  line  0*7  +  (1  -  a)r  =  z  and 

Lz{a,z:n2)=  (  — - r)d/.i2{q,r)+  ('"--7 M/*2(g,r). 

■iRu A  V  1  -  a  )  Jrvr  \  I -a  J 


Formally,  economists  are  estimating  a  misspecified  model.  A  justification  for  not  worrying  about 
economists  noticing  the  misspecification  is  that  in  a  more  realistic  model  economists  would  only  receive 
a  finite  number  of  datapoints  and  there  would  be  a  random  component  to  each  observation,  so  the  form 
of  the  misspecification  would  not  be  so  appaient.  The  idea  that  economists  struggle  to  reconcile  haid-to- 
reconcile  observations  while  maintaining  biased  self  images  does  not  seem  unrealistic  to  me. 
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where  Rua  is  the  set  of  (17,  r)  values  for  which  papers  were  "unexpectedly  accepted"  despite 
failing  to  meet  the  (ct,z)  standard  and  Rur  is  'he  set  of  '"unexpectedly  rejected"  papers 
that  met  the  (a,  z)  standard  but  were  rejected.  L2  can  be  thought  of  as  the  product  of  the 
fraction  of  accept/reject  decisions  that  are  inconsistent  with  the  (a,  2)  model  and  the  aver- 
age degree  of  error  (in  the  r-dimension)  that  appears  to  be  embodied  in  the  "unexpected" 
decisions. 

Obviously  other  loss  functions  would  be  reasonable.  The  most  natural  would  probably 
be  the  negative  of  the  log  likelihood  of  the  data  under  a  hypothesis  involving  referees  and 
the  editor  trying  to  apply  the  (a,  2)  norm  but  making  idiosyncratic  errors  in  judging  the 
quality  of  each  paper.  Analyzing  such  a  specification  would  require  cxaminining  integrals  of 
CDF's  and  PDF's,  however,  and  I  felt  that  the  specification  above  was  the  best  compromise 
in  terms  of  reflecting  a  similar  goodness-of-fit  notion  and  being  tractable. 

6      Analysis  of  the  dynamic  model  with  no  overconfidence  bias 

(e  =  0) 

In  this  section  I  discuss  the  behavior  of  the  dynamic  model  when  economists  do  not  have 
an  inflated  view  of  the  quality  of  their  own  work.  The  main  observations  arc  that  consis- 
tent social  norms  arc  steady  states  of  the  model  and  that  when  referees  are  too  demanding 
economists  infer  both  that  their  standards  were  too  high  and  that  g-quality  must  be  rela- 
tively less  important  than  they  had  thought. 

6.1      Steady  states 

When  there  is  no  overconfidence  bias  it  is  easy  to  see  that  any  consistent  social  norm  of 
the  static  model  is  a  steady  state  of  the  dynamic  learning  model. 

Proposition  5  Suppose  e  =  0  in  the  dynamic  model  and  (oq,  zq)  is  a  consistent  social 
norm..   Then,  (at,  zt)  =  (ocq,  zq)  for  all  t. 

Proof 

All  points  (q,  r(q))  in  the  data  obtained  from  referees'  reports  lie  exactly  on  the  line 
c*o<7+  (1  -cto)r(q)  =  z0.  The  editor's  decisions  are  also  consistent  with  imposing  the  (0.0,  20) 
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standard,  i.e.  all  rejected  papers  have  ao<7+  (1  —  Q'o)r  ~  zo  <  0  and  all  accepted  papers  have 
a$q  +  (1  —  ao)r  —  zq  >  0.  Hence,  both  L\  and  L2  are  zero  for  (a,  z)  =  (c*o,  zo)-  Because 
the  ^-distribution  is  nonatoinic  on  a  continuous  support,  L\  is  strictly  positive  (and  L2  is 
always  nonnegative)  for  any  other  (a,  z).  The  unique  minimum  of  the  loss  function  is  thus 
(ao,zo)  =  (ao,  zo)- 
QED. 

6.2      Disequilibrium  dynamics 

In  this  section  I  discuss  the  disequlibrium  behavior  of  the  dynamic  model.  The  results 
in  this  section  (and  in  the  remainder  of  the  paper)  will  concern  the  uniform  technology 
for  (/-production,  q  ~  U[0,tq}.  The  whole  of  what  I  want  to  say  in  this  section  can  be 
summarized  concisely  by  saying  that  the  dynamic  evolution  of  economists'  beliefs  about 
the  social  norm  (a,  z)  outside  of  equilibrium  follows  the  pattern  illustrated  in  Figure  5. 
The  figure  was  constructed  by  solving  the  model  numerically  for  various  initial  beliefs 
under  the  assumption  that  q  ~  U[0,1q],  r  =  0.3,  a  —  0,2  and  h(t)  =  yjl  —  t/2.  The  sohd 
line  in  the  figure  is  the  locus  of  consistent  social  norms  (a,  z*(a)).  The  vectors  in  the  figure 
are  proportional  to  the  change  in  beliefs  (a/  +  i  —  o?,  z/+]  —  zj)  that  occurs  for  various  initial 
beliefs.14  To  help  organize  the  dynamics  I  have  also  placed  a  vertical  dashed  line  in  the 
figure  at  a  =  0.3363.  This  is  where  the  equilibrium  shifts  from  the  somewhat  low  a  form 
to  the  typical  form. 

The  locus  of  consistent  social  norms  and  the  dashed  line  divide  (a,z)  space  into  four 
regions.  In  three  of  the  four  regions  the  learing  process  is  mostly  just  a  straightforward 
adjustment  of  the  overall  quality  threshold:  when  referees  try  to  impose  a  standard  that 
would  not  allow  the  editor  to  fill  the  journal  they  infer  from  the  unexpected  acceptances 
that  they  must  reduce  2;  when  referees  are  too  soft  and  the  editor  has  to  turn  down  some 
papers  they  recommend  they  learn  to  choose  a  higher  z.15 


14The  speed  with  which  beliefs  change  in  the  model  is  determined  by  an  arbitrary  scaling  parameter  k. 
The  figure  uses  k  =  0.6.  Note  that  unlike  all  other  graphs  in  this  paper  the  x  and  y  axes  in  this  figure  do 
not  have  thf  same  scale.  The  j/-axis  has  been  magnified  by  a  factor  of  1.5.  These  choices  reflect  an  attempt 
to  maximize  ihe  visibility  of  the  directions  and  minimize  clutter. 

I5The  dynamics  in  the  2  <  z*(q)  and  a  large  region  fit  this  desciiption  only  if  a  is  not  too  large.  I  discuss 
what  happens  in  the  "high  o"  case  at  the  end  of  this  section. 
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Figure  5:  The  disequilibrium  evolution  of  social  norms  when  f  =  0. 

What  is  most  important  to  my  main  argument  is  what  happens  in  the  fourth  region 
-  the  upper  right  part  of  Figure  5.  Suppose  that  the  (at,Zt)  standard  is  unreasonably 
high.  i.e.  z  >  z'(a).  Suppose  also  that  the  distribution  of  resubmitted  papers  has  the 
''typical'''  form.  Economists  will  then  (correctly)  perceive  that  referees  are  asking  them  to 
meet  a  very  high  standard  and  that  some  papers  that  they  thought  were  submarginal  are 
being  accepted.  The  conflict  in  this  data  leads  economists  to  do  two  things:  to  believe  that 
quality  standards  are  less  demanding  than  they  had  thought;  and  to  believe  that  r-quality 
is  relatively  more  important  than  than  they  had  thought. 

To  illustrate  why  economists  make  this  inference,  Figure  6  contains  an  enlarged  view 
of  the  data  economists  get  in  one  such  case.16  The  bold  line  represents  the  (q,r(q))  dat- 
apoints  they  get  from  referee  reports.  The  outlined  area  is  the  support  of  the  (uniform) 
quality  distribution.  All  papers  above  and  to  the  right  of  the  bold  line  are  accepted.  The 
journal  editor  also  accepts  papers  in  the  shaded  region  (to  the  surprise  of  the  referees). 
Lower  quality  papers  are  rejected.  The  lines  below  the  g-axis  and  to  the  left  of  the  r-axis 


The  figure  graphs  the  quality  distribution  and  the  best  fit  (d,  £)  when  initial  beliefs  are  that  a  =  0.5 
Hid  z  =  0.53417  with  q  ~  U[0A.q}t  h(l)  =  y/i -  t/2,  a  =  0.2  and  r  =  0.3. 
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illustrate  the  support  of  the  q  and  r  distributions  among  resubmitted  papers.  I  have  used 
bold  lines  to  show  the  support  of  the  quality  distribution  for  the  unexpectedly  accepted 
papers.  These  papers  are  at  the  low  end  of  the  q  distribution  and  at  the  high  end  of  the  r 
distribution.  Economists'  rationalize  these  acceptances  in  part  by  concluding  that  r  must 
be  more  important  than  they  had  thought.  The  dashed  line  graphs  the  social  norm  (a,  z) 
that  best  fits  the  data.  The  flatter  line  accounts  for  many  unexpected  acceptances  and 
gives  a  good  fit  to  the  high  q  part  of  the  [q,r(q))  data. 
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Figure  6:  Economists'  inferences  when  standards  are  too  high:  the  "typical"  case 

Figure  7  contains  a  similar  diagram  illustrating  economists  inferences  when  referees' 
beliefs  (at,zt)  are  unreasonably  tough  and  at  is  sufficcntly  low  so  that  the  distribution  of 
resubmitted  papers  has  the  "low  a"  form  --  all  papers  are  resubmitted  and  all  authors 
devote  the  same  maximum  effort  to  their  revisions.  Here,  the  "unexpectedly  accepted" 
papers  are  uniformly  distributed  in  the  q  dimension,  and  the  dotted  line  illustrates  that  the 
best  fit  is  obtained  by  slightly  lowering  z  while  leaving  the  slope  of  the  line  unchanged. 

Even  with  the  simple  loss  function  I've  chosen,  getting  analytic  expressions  for  the 
optimal  inference  given  an  inconsistent  (q(,  zt)  is  difficult.  Proposition  6  is  a  characteri- 
zation of  the  dynamics  that  brings  out  the  main  observations  I've  mentioned  above.  The 
proposition  characterizes  the  dynamics  for  initial  beliefs  that  are  close  to  being  consistent, 
i.e.    for  zt  close  to  z*(at).    Parts  (a)  and  (b)  note  that  in  the  low  a  and  somewhat  low 
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Figure  7:  Inferences  when  standards  are  too  high:  the-  "low  a"  case. 

a  cases  economists  do  not  adjust  their  estimate  of  a  (at  least  approximately)  and  adjust 
their  estimate  of  z  toward  z*(at).  Part  (c)  notes  that  when  referees'  beliefs  correspond  to 
a  standard  that  is  too  low  the  dynamics  are  similar  in  the  "typical"  case:  the  dynamics  are 
approximately  vertical  when  the  social  norm  is  approximately  consistent.  Part  (d)  relates 
to  my  main  observation.  It  notes  that  when  zt  is  slightly  larger  than  z*(at)  the  dynamics 
involve  both  a  reduction  in  a  and  a  reduction  in  the  overall  quality  standard  (after  ac- 
counting for  the  change  that  is  induced  mechanically  by  the  change  in  a).  The  shifts  in  the 
two  parameters  arc  comparable  in  magnitude.  The  proof  of  the  proposition  is  contained  in 
Appendix  A. 

Proposition  6  Consider  the  dynamic  model  described  above  with  F(q\tq)  ~  (7[0, tq}.  Let, 
tq(a.z)  be  economists'  optim.al  time  allocation  when,  they  believe  that  the  social  norm  is 
(a,z).  Write  b(z)  sa  a{z  —  z*(a))  as  shorthand  for  lim,^,.(Q)  T_,'f(  x  =  a. 
(a)  Suppose  that  for  a  given  Q(  £  (0, 1)  the  unique  equilibrium  for  the  social  norm  (at,  z*(cxt)) 
has  the  low  a  form,.  Then,  there  exists  a  constant  a  >  0  such  that  for  zt  in  a  neighborhood 
of  z*(q()  the  dynamics  have 

at+i  -  at     =     0 
zt+\  -  zt     =     o.(z*(at)  -  2() 
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(b)  Suppose  that  for  a  given  at  £  (0, 1)  the  unique  equilibrium  for  the  social  norm,  (at,z*(ctt)) 
has  the  somewhat  low  a  form  and  that  —^-  is  strictly  positive  at.  t  =  tq(at,z*(at)).  Then 
there  exists  a  constant  a  >  0  such  that  for  zt  close  to  z*{at)  the  dynamics  have 

Q't  +  i  -  a*     ~     0 
zt+i  ~zt     ~     o.(z*(at)  -  zt) 

For  zt  slightly  larger  than.  z*(at)  the  dynamics  have  at+\  —  aj  =  0. 

(c)  Suppose  that  for  a  given  at  G  (0, 1)  the  unique  equilibrium,  for  the  social  norm  (at,  z*(at)) 
has  the  "typical"  form,  and  zt  <  z*(at).  Then  there  exists  a  constant  a  >  0  such  that  for  zt 
close  to  z*(a{)  the  dynamics  have 

at+i  -  at     «     0 
zt+\  -  zt     Rs     a.(z*(a,)  -  z,), 

(d)  Suppose  that  for  a  given  at  6  (0,  1)  the  unique  equilibrium,  for  the  social  norm,  (at,  z*(at)) 
has  the   ''typical"  form,  and  zt   >   z*(at).    Let  Mq   =  (q(at,zt)  +  tq(at.zt))/2  and  MT  = 

Then,  there  exist  constants  a\  >  0  and  «2  >  0  such  that  for  zt  close  to  z*(at)  the 


l-oc. 

dynamics  have 

q,  +  1  -  at     «     -ai(zt  -  z* (at)) 
zt+i  -  zt     «     a2(z* (at)  -  zt)  +  (at+i  -  at)(Mq  -  Mr) 

The  one  notable  feature  of  the  Figure  5  that  I  have  left  out  of  the  discussion  so  far 
is  what  happens  when  z  <  z*(a)  and  a  is  sufficiently  high  so  that  the  equilibrium  has 
the  high  a  form  It  is  apparent  from  the  figure  that  in  this  case  economists  conclude  that 
q  is  relatively  more  important  than  they  had  thought.  The  argument  is  similar  to  the 
argument  for  why  economists  conclude  that  r  is  more  important  than  they  had  thought 
when  standard  are  too  high  in  a  typical  equilibrium.  The  unexpectedly  rejected  papers 
arc  relatively  low  q  papers  and  hence  a  steeper  line  allows  referees  to  account  for  many  of 
the  unexpected  rejections  while  maintaining  a  good  fit  to  the  r(q)  data  for  high  q  papers. 
I  have  not  discussed  this  case  in  more  detail  because  in  a  couple  of  ways  the  argument 
here  seems  less  plausible.    First,  it  involves  economists  mistakenly  not  achieving  levels  of 
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r-quality  that  they  could  have  achieved.  Second,  the  effect  derives  from  economists  trying 
to  fit  datapoints  with  r(ij)  negative. 

7      The  overconfidence  bias  and  gradual  evolution 

In  this  section  I  show  that  adding  a  slight  overconfidence  bias  produces  a  model  in  which 
social  norms  can  slowly  and  steadily  evolve  over  a  long  period  to  place  ever  more  emphasis 
on  r-quality. 

7.1      e-perturbations  of  dynamics  with  a  continuum  of  steady-states 

Before  discussing  the  model,  it  is  instructive  to  discuss  its  stucturc  in  more  generality.  The 
dynamic  model  is  of  the  form 


at+\  -  at    \  _  ,  /    at  -  at 
~t+i  ~  zt    J  \    zi.  ~  zt 

where  (pi,z)  =  argmmQjZ  L(a, z.\a.t,zt).  A  social  norm  (at,zt)  is  a  steady-state  only  if  it 
is  a  solution  to 

— -{auzf,auzt)     =     0 

aa 

dL<  ^  n 

-T-K^L-,zuat,zt)     —     0 

OZ 

This  is  a  system  of  two  equations  in  two  unknowns.  Ordinarily,  one  would  expect  such  a 
system  to  have  one  solution  (or  zero  or  a  few).  The  fact  that  the  dynamic  model  of  the 
previous  section  has  a  continuum  of  equilibria  indicates  that  it  is  somewhat  special. 

What  happens  if  we  take  a  dynamic  model  with  a  continuum  of  equilibria  and  perturb 
it  slightly?  The  answer  of  course  depends  on  how  the  system  is  pertubed.  To  take  a  simple 
example,  consider  a  dynamic  of  the  form  above  with  the  loss  function 

L(a,  z;  au  zt)  =  {a  -  at)2  +  (z  -  z,)2  +  (z  -  a)2. 

This  model  has  every  point  on  the  line  zt  =  at  as  a  steady  state.  One  thing  that  can 
happen  with  an  c-pertuibation  is  an  f-order  shift  in  the  set  of  steady  states.  For  example, 


While  I  earlier  defined  L  as  depending  on  the  measures  fx\  and  jj.7  describing  the  two  types  of  data,  /<] 
and  H2  are  themselves  functions  of  at  and  zt.  I  use  at  and  zt  as  arguments  of  L  in  this  section  to  clarify 
the  nature  of  the  dynamic. 

9.F, 


if  wc  perturb  the  loss  function  above  to 

L(a,z;at,zt,e)  =  (a  -  a,)2  +  (2  -  zt)2  +  (z  -  (a  +  e))2, 

the  system  has  a  continuum  of  steady  states  given  by  zt  =  at  +  e. 

With  a  generic  perturbation,  however,  the  continuum  of  steady  states  will  disappear. 
For  example,  if  wc  instead  perturb  the  system  to 

L(a,z;at,zt,e)  =  (a  -  at)2  +  (z  -  zt)2  +  (2  -  a)2  +  ez2, 

the  only  remaining  steady-state  is  at  =  zt  =  0.  What  happens  to  the  former  equilibria?  In 
this  example,  the  6  perturbed  dynamics  are 

3  +  2f  3  +  2f 

3  +  zf  3  +  ze 

The  first  terms  on  the  right  hand  sides  of  these  equations  tell  us  that  from  any  initial 
condition  the  dynamics  lead  quickly  to  a  neighborhood  of  the  nearly  stable  locus  zt  =  a>. 
The  system  then  evolves  at  an  e  rate  in  a  neighborhood  of  this  locus  toward  the  steady 
state.  If  the  initial  condition  is  very  from  the  steady  state  a  long  gradual  evolution  would 
be  observed. 

Obviously,  the  most  natural  way  to  account  for  a  long  gradual  trend  in  an  economic 
variable  will  usually  be  to  view  the  trend  as  reflecting  a  continuous  shift  in  the  equilibrium 
of  a  model  due  to  a  trending  exogenous  variable.  The  most  general  idea  the  dynamic  model 
is  intended  to  convey  is  that  the  disequilibrium  dynamics  in  perturbations  of  models  with  a 
continuum  of  equilibria  may  provide  an  alternate  method  for  explaining  some  such  trends. 

7.2      Social  norms  for  academic  publishing 

My  main  observation  about  the  model  with  no  overconfidence  bias  was  that  if  referees  try  to 
hold  authors  to  an  unreasonably  high  standard,  then  social  norms  will  evolve  to  place  more 
weight  on  r-quahty  This  disequilibrium  shift  is  not  an  appealing  candidate  for  explaining 
observed  shifts  in  academic  standards  for  two  reasons.  First,  such  an  explanation  would 
be  incomplete  in  that  we  have  no  reason  to  expect  initial  beliefs  to  be  too  demanding  as 
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opposed  to  too  undemanding.  Second,  and  more  importantly,  the  model  does  not  fit  a  long 
gradual  trend.  Instead,  social  norms  in  such  a  model  move;  very  quickly  to  a  neighborhood 
of  the  equilibrium  set  and  then  converge  slowly  without  much  further  movement  along  the 
equilibrium  set. 

The  dynamics  with  an  e  ovcrconfidence  bias  are  qualitatively  different.  When  beliefs 
are  far  from  the  equilibrium  set,  the  e-pcrturbation  has  little  impact  —  we  again  see  rapid 
convergence  to  a  neighborhood  of  the  set  of  equilibria  of  the  unperturbed  model.  The  over- 
confidence  then  becomes  important.  In  the  typical  case  it  destabilizes  the  former  equilibria. 
If  economists'  initial  beliefs  correspond  to  a  consistent  social  norm,  then  their  mispercep- 
tions  of  the  referee  reports  they  receive  will  lead  them  to  conclude  that  overall  quality 
standards  must  be  higher  than  they  had  thought  (i.e.  z  —  zt  >  0.)  For  this  reason,  referees 
will  perpetually  try  to  hold  authors  to  a  standard  that  is  slightly  higher  than  is  feasible.  The 
e-overconfjdcncc  does  not  affect  the  argument  that  economists  will  conclude  that  r-quality 
is  more  important  than  they  had  thought  whenever  standards  are  higher  than  is  feasible. 
Hence,  when  the  equilibrium  has  the  typical  form  we  will  observe  a  gradual  evolution  of 
social  norms  through  the  near  equilibrium  set  toward  lower  a  values. 

Musi-  we  look  forward  to  a.  world  with  no  (/-quality  whatsoever?  In  my  model  at  least, 
the  answer  is  no.  In  the  neighborhood  of  a  consistent  social  norm  the  effect  of  a  small 
overconfidence  bias  is  to  make  economists  think  that  z  must  be  slightly  higher  than  they 
had  thought.  In  the  low  a  and  somewhat  low  a  cases,  when  the  standard  is  slightly  too  high 
economists  infer  that  z  must  be  slightly  lower  than  they  had  thought.  For  any  such  a,  the 
two  effects  turn  out  to  exactly  offset  for  some  z  slightly  greater  than  z*(a).  The  e-pcrturbed 
model  has  a  continuum  of  steady-states  lying  just  above  the  low  a  and  somewhat  low  a 
portions  of  the  set  of  steady  states  of  the  unperturbed  model.  The  model  thus  predicts  that 
the  evolution  of  social  norms  will  come  to  a  halt  as  soon  as  r  is  sufficiently  important  so 
that  no  paper's  idea  is  good  enough  to  let  its  author  be  sure  that  he  or  she  will  eventually 
be  able  to  get  an  acceptance. 

Figure  8  illustrates  the  dynamics  of  the  system  with  a  small  overconfidence  bias.  The 
figure  was  generated  by  solving  the  model  numerically  for  various  initial  beliefs  under  the 
assumption  that  q  ~  U[0,tq],  r  =  0.3,  a  =  0.2,  h(t)  =  y/t  -  t/2  and  e  =  0.01.   The  thick 


27 


solid  curve  on  the  left  side  of  the  figure  is  the  locus  of  steady  states  (a,  zf(a)).  Ou  the  right 
side  of  the  figure  I've  graphed  eight  curves  illustrating  the  evolution  of  social  norms  from 
eight  initial  conditions.  While  the  curves  appear  to  join  together  well  before  they  reach 
the  left  endpoint,  they  actually  remain  separate  over  the  whole  range  and  in  an  extremely 
magnified  figure  what  now  appears  to  be  one  central  curve  would  be  revealed  to  be  a  set 
of  nearly  parallel  curves.  From  initial  conditions  with  z  large  the  system  initially  evolves 
down  and  to  the  left  toward  the  set  of  nearly  stable  norms  and  then  moves  along  this  set 
until  the  somewhat  low  a  region  is  reached.  From  initial  conditions  with  z  small  there  is 
an  initial  evolution  up  toward  the  nearly  stable  set,  and  then  again  a  movement  to  the  left 
along  the  nearly  stable  set. 

The  arrows  on  the  curves  give  a  feeling  for  the  speed  of  movement.  The  arrows  mark 
the  beliefs  that  prevail  after  one  period  of  evolution  and  then  after  every  ten  additional 
periods.18  The  fact  that  the  arrows  are  initially  far  apart  on  each  curve  reflects  that  the 
initial  movement  toward  the  nearly  stable  set  is  rapid  --  from  most  starting  points  the 
curves  get  sufficiently  close  to  the  nearly  stable  set  to  be  indistinguishable  lo  the  naked 
eye  within  ten  or  twenty  periods.  The  fact  that,  arrows  then  become  closely  and  regularly 
spaced  as  norms  evolve  along  the  nearly  stable  set  reflects  that  this  evolution  is  slow  and 
steady.  Here,  the  evolution  from  a  near  equilibrium  with  a  =  0.8  to  a  near  equilibrium 
with  a  =  0.4  takes  about  two  hundred  periods. 

Proposition  7  provides  a  formal  description  of  some  properties  of  the  system.  Part 
(a)  notes  that  the  model  has  a.  continuum  of  equilibria  covering  roughly  the  low  a  and 
somewhat  low  a  ranges.  Part  (b)  notes  that  when  a  is  such  that  the  equilibrium  has  the 
typical  form  there  is  no  steady  state:  at  (a,z*(a))  the  data  is  better  fit  by  increasing  z; 
when  2  is  slightly  higher  than  z*(a)  the  fit  is  improved  by  reducing  a. 

Proposition  7  Consider  the  dynamic  model  described  above  with  F(q\tg)  ~  U[0, tq\.  Let 
tq(a,z)  be  economists'  optimal  time  allocation  when  they  believe  that,  the  social  norm,  is 
(a,z), 

(a)  Suppose  that,  for  a  given  a  G  (0,  1)  the  unique  equilibrium  for  the  social  norm.  (a,z*(n)) 
with  e  =  0  has  the  low  a  or  somewhat  low  a  form..    Suppose  that  ^r  is  strictly  positive 
1  In  constructing  the  figure  I  took  k  =  0.5 
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Figure  8:  The  disequilibrium  evolution  of  social  norms  when  e  >  0. 

at  t.  —  tq(a,z*(a)).   Then,  for  (  sufficiently  small  there  exists  a  value  of  z,  z((a)  such  that 
(a,2f(a))  is  a  steady  state  of  the  model  with  e-overconfidence 

Further,  there  exists  a  positive  constant  a,  such  that.  zf  (a)  —  z*(a)  ~  ae  when  e  is  small.  In 
the  low  a  case  zc(a)  is  exactly  equal,  to  z*(a)  +  2(1  —  a)ae  when  e  is  sufficiently  small. 

(b)  Suppose  that  for  a  given  a  €  (0, 1)  the  unique  equilibrium  for  the  social  norm 
(o,z*(a))  with  e  —  0  has  the  typical  form.  When  c  >  0  we  have  ^(a,  z*(a);a,  z*(a))  <  0. 
For  e  sufficiently  small  there  exists  a  z  >  z*(a)  for  which  the  distribution  of  paper  qualities 
and  expectedly  and  unexpectedly  accepted  papers  has  the  typical  form  pictured  in  Figure  6 
and  the  loss  function  has  -g^{a,  z;a,  z)  =  0.  For  any  such  z,  f^(«,  z;a,z)  <  0.  Hence,  the 
system,  does  not  have  a  steady  state  where  the  distribution  of  paper  qualities  has  the  typical 
form. 

Let  z((a)  be  the  smallest  value  of  z  such  that  ^(a,z;a,z)  =  0.    Then  there  exist  positive 
constants  a  and  b  such  that  zt(a)  -  z*(a)  ss  ae  and  ^(a.  z( (a);  a,  z( (a))  w  —  be  for  small 


For  those  who  do  not  believe  that  academics  suffer  from  overconfidence  bias  (and  those 
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who  are  wondering  if  observed  changes  in  standards  should  be  taken  as  providing  evidence 
of  such  s  bias),  I  would  like  to  note  that  a  gradual  evolution  toward  lower  a  norms  can 
be  generated  by  many  small  modifications  to  the  basic  dynamic  model.  Essentially,  any 
perturbation  that  makes  referees  perpetually  try  to  hold  authors  to  a  slightly  infeasibly  high 
standard  will  do  the  same  thing.  One  example  would  be  an  assumption  that  referees  try 
to  make  up  more  extensive  lists  of  revisions  than  is  standard  to  impress  editors  with  their 
thoroughness.  Another  would  be  an  assumption  that  referees  arc  competitive  (or  spiteful) 
and  try  to  hold  back  others  in  their  field  by  imposing  standards  that  are  slightly  higher 
than  the  norm  (perhaps  because  making  requests  that  are  even  more  demanding  would 
expose  their  spite). 

In  a  standard  one-dimensional  model  where  the  optimal  level  of  revisions  is  determined 
by  a  cost  of  revisions  or  in  a  one-dimensional  learning  model  where  referees  are  learn  z  by 
observing  referees'  reports  and  editors'  decisions,  an  p-pcrturbation  like  these  just  produces 
a  mode!  in  which  the  original  equilibrium  (or  steady-state)  is  shifted  by  c.  In  the  two- 
dimensional  learning  model,  however,  e  changes  can  greatly  change  the  equilibrium  set  and 
create  steady  long-term  disequilibrium  evolution. 

8      Conclusion 

I  have  proposed  that  it  is  helpful  to  think  about  observed  changes  in  the  economics  pub- 
lishing process  with  a  two-dimensional  q-r  model  of  quality.  The  idea  behind  the  dynamic 
model  is  that  social  norms  for  weighing  different  aspects  of  quality  may  shift  as  economists 
struggle  to  reconcile  the  high  standards  being  applied  to  them  with  the  mediocrity  they 
see  in  journals.  The  model  predicts  a  long  gradual  shift  in  standards.  Academics  react 
by  spending  'less  and  less  time  developing  new  insights  and  more  and  more  time  polishing 
papers. 

Among  the  many  possible  extensions  to  the  model,  I  view  the  incorporation  of  random- 
ness in  the  assessment  of  quality  and  in  economists'  data  samples  as  the  most  intriguing. 
Referees  who  receive  signals  suggesting  that  standards  arc  higher  than  they  arc  may  con- 
clude that  r-quality  is  more  important  than  they  thought,  while  referees  who  receive  signals 
suggesting  that  standards  arc  lower  than  they  are  will  not  draw  the  opposite  conclusion. 
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Even  without  an  ovcrconfidcnce  bias  wc  may  sec  increasing  emphasis  placed  on  r-quality. 
Sobel's  (1908)  work  suggests  that  randomness  in  assessments  due  to  heterogeneity  in  tastes 
may  have  more  subtle  effects. 

What  are  the  welfare  effects  of  changes  in  publication  standards?  There  are  no  answers 
within  my  model,  because  individual  preferences  over  q  and  r  do  not  appear.  If  preferences 
arc  time  invariant,  then  welfare  changes  as  standards  evolve.  Whether  changes  are  welfare 
increasing  or  welfare  decreasing  depends  on  how  the  prevailing  social  norm  differs  from  the 
optimum  given  authors'  and  readers'  preferences. 

The  changes  of  the  last  few  decades  have  had  a  substantial  impact  on  economists.  Many 
young  economists  report  spending  as  much  time  revising  old  papers  as  working  on  new  ones. 
Guiding  larger  and  more  frequent  revisions  is  an  additional  burden  on  referees  and  editors. 
If  the  changes  that  have  occurred  were  accidental  and  many  ether  social  norms  arc  really 
possible,  then  a  discussion  in  the  profession  of  what  standards  we  would  like  to  have  would 
be  valuable. 

While  I  have  emphasized  the  model  of  learning  with  ovcrconfidencc.  other  explanations 
for  the;  observed  changes  are  possible.  Indeed,  one  of  the  main  motivations  for  the  static- 
model  is  to  clarify  how  the  growth  of  the  economies  profession,  changes  in  the  set  of  ideas 
waiting  to  be  discovered  and  other  factors  might  alter  the  publication  process.  Ultimately, 
which  of  the  potential  explanations  are  most  important  is  an  empirical  question.  I  attempt 
to  begin  this  sorting  out  in  Ellison  (2000). 

How  else  might  one  examine  the  evolutionary  model  empirically?  With  ideal  data 
one  could  directly  examine  referees'  demands  and  see  how  they  are  affected  by  referees' 
experiences  as  authors.  A  general  premise  of  the  model  that  may  be  more  easily  examined 
is  that  learning  occurs  within  communities  of  referees.  A  good  empirical  test  would  be 
to  observe  whether  a  random  shock  to  a  particular  field  at  a  particular  journal,  e.g.  the 
appointment  of  a  new  editor  who  likes  revisions,  affects  how  papers  in  the  field  are  treated  at 
other  journals.  Another  feature  that  distinguishes  the  evolutionary  model  from  equilibrium 
models  is  that  transitory  shocks  have  permanent  effects.  For  example,  even  after  a  revision- 
loving  editor  is  replaced,  the  fact  that  referees  learned  a  high  revision  norm  will  ensure  that 
publication  remains  slow  for  years  to  come. 
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In  addition  to  trying  to  change  how  people  think  ahout  academic  publishing,  I  have 
tried  to  make  the  general  point  that  a  long  run  trend  can  be  a  disequilibrium  phenomenon. 
Comparative  statics  of  equilibria  will  remain  the  standard  for  explaining  trends,  but  I  hope 
that  models  like  that  I've  developed  here  will  find  other  applications. 
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Appendix  A 

Pi  oof  of  Proposition  6 
(a)  In  the  low  a  case  for  (z,tq)  in  a  neighborhood  of  (z*(at).tq(at,  z*(at)))  we  have 

a  \      1  -  at 

This  function  has  Jp  =  ~2g(i'-'-o  )  "^  ^'^  ~~  *?)>  w^ich  is  independent  of  z.    It  also  has 

^f  =  —h"[\  —  tq)  >  0,  and  hence  in  a  neighborhood  of  z*(at),  tq(at,zt)  is  independent  of 

2;.  The  continuity  of  the  thresholds  implies  that  the  ordering  q  <  0  <  m.{tq)  <  q  holds  for 
all  zt  in  a  neighborhood  of  z*(at). 

Write  ra{q;at,  zt)  for  the  level  of  r-quality  that  is  actually  necessary  for  an  editor  to 
accept  a  paper  of  ^-quality  q  when  the  population  believes  the  standard  is  (at,  zt).  Because 
tq(at,zt)  is  locally  independent  of  zt  and  all  authors  revise  to  the  greatest  extent  possible. 
ra(q;at,  z*)  =  r{q\at,z*{at))  for  zt  near  z*(at).  The  situation  is  thus  like  that  pictured 
in  Figure  7.  When  zt  <  z*(at)  the  situation  is  similar,  but  involving  a  parallelogram  of 
unexpectedly  rejected  papers  lying  just  above  the  (q,r(q))  line. 

To  describe  economists'  inferences,  I'll  make  a  change  of  variables  and  analyze  the 
minimization  of  the  loss  function  over  m  and  w  where  m  =  ^^  is  the  slope  of  the  line 

and   w  —   Z\a_a'1    is  the  level  of  r  quality  required  of  a  paper  with  the  mean  ^-quality, 
Mq  ee  (t.q(at,zt)  +  max(0,  q{atl  zt )))/2.  I'll  carry  out  the  calculations  for  zt  >  z*(q{).  The 
calculations  in  the  opposite  case  are  identical  but  with  some  signs  reversed. 
With  the  change  of  variables  L\  takes  on  a  very  simple  form.19 

rtq(nt,zt)  5  1 

L-l-(m,  w)     =      /  {m(q  -  Mq)  +  w  -  r(q;  at,  zt))~  — -dq 

Jo  t.g(at,zt) 

1  fti(<*t,zt) 

/  {m{q  -  Mq)  +  w-  (mt(q  -  Mq)  +  wt)Y  dq 

JO 


tq(<Xt,Zt) 
1 

tq(atlzt) 

1 


9  (q  —  -A^o)3  2  *> 

mt)  — h  (m  -  mt)(w  -  wt)(q  -  Mq)    +  (w  -  wt)-q 


tq(at. 


jttq(at,  z,)2{m  -  mr)2  +  (w  -  wt)2 

By  making  a  few  comparisons  it  is  easy  to  sec  that  the  minimum  of  L\  +  Lo  can  not  be 
achieved  for  w  >  wt,  or  for  w  <  ra(Mq,at,zt),  or  with  a  slope  m  for  which  iv  +  m(  —  Mq) 
is  outside  the  interval  [ra(0;  at,  zt),r(0;  at,  zt)\.  (For  example,  to  show  the  second  note  that 
for  any  such  w,  L(m,,w)  >  L{mt,ra{Mq,at,zt))  because  L\  is  smaller  at  (mt,ra(Mq,at,zt)) 
and  L2  has  its  global  minimum  there.)  In  the  range  containing  any  potential  minumum, 
L2  turns  out  to  also  be  a  simple  quadratic. 

rtq(at.zt)     ru--ra(Mq;at,zt)+{m-Tnt)(q-Mq)      1  ] 

L-2{m.iu)     =  /  r—dr— -dq 

lo  Jo  cf      fq(at,zt) 


To  make  the  formulas  more  readable  I'll  often  omit  the  /j]  and  f.12  arguments  of  L\  and  L2.  /Ji  and  /xo 
are  determined  by  q(  and  zt,  so  I'll  also  sometimes  substitute  at  and  2,  (or  mt  and  wt)  as  arguments.  I  will 
also  omit  arguments  of  M9,  f,,  q,  and  q  to  improve  readability. 
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1  rtq(at,zt) 


2atq(at,  zt)  Jo 


fiq(at.,zt) 

/  (w  -  ra{Mq\  au  zt)  +  (m.  -  m,)(q  -  Mq))zdq 

la 


1 

r 

Hence. 


tq(at,zt)2{m  -  rnt)2  +  --(w  -  ra(Mq:at,zt))2. 
Zia  lo 


L(rn,,w\mt,wt)  =  -t q(at,  zt)2 (m  -  mt)2  +  {w  -  ivt)2  +  —(w  -  ra{Mq; at, zt))2 . 

The  minimum  clearly  involves  m  =  mt.  This  gives  the  first  part  of  the  conclusion:  a  =  at. 
The  minimizing  value  for  w  is  w  =    "^        2+1"  V  q'at'Z    -  Recall  that  ra  is  independent  of 
zt  in  a  neighborhood  of  z*(at).  We  thus  have 

...  v  Z*(Qt)-QtMg 

ra  (M9;  ot ,  z£)     = 

1  —  at 

z*(at)-zt       zt  -  atMq       z*(at)  -  zt 

=      ■ 1 =  ■ h  Wt. 

I -at  1  -  at  1  -  at 

Substituting  this  into  the  expression  for  w  gives 

[2o  +  ly(l  -af) 

Using  the  identities  at  =  a  and  zt  =  at_Afq  +  (1  —  0.7)^  allows  us  to  conclude  as  desired 
that 

7-t+\  ~  zt     =     k  {aMg  +  (1  -  &)u>  -  zt)  =  k  (atMq  +  (1  -  at)w  -  atMq  +  (1  -  0.7)10,) 

=     fc(l  -  at)(u>  -  iut)  =  fc- — — -(z*(Qf)  -  zt). 

lo  h  1 

(b)  The  proof  for  the  somewhat  low  a  case  is  very  similar.  One  difference  is  that  the  form 
of  G  is  different.  This  makes  it  possible  that  the  second  derivative  of  G  will  not  be  strictly 
positive  at  tq(at,  z*(at)),  in  which  case  tq(at,  zt)  might  not  be  differentiable  at  zt  =  z*(at). 
This  would  cause  many  complications.  To  avoid  them.  I've  just  assumed  in  the  proposition 
that  -Q^-{at,tq(at,  z*(at)))  >  0.  This  ensures  that  tq(at,z)  is  differentiable  in  z  at  z*(at). 
The  analysis  of  the  zt  >  z*(at)  case  then  proceeds  exactly  as  above  to  show  that 

a  —  at     =     0 

m    =    o — r Ara{hiq\at,z,)  -  wt). 

la  +  1 


w 


Write  Ta(zt)  =  1  —  G(zj;  tq(at,  zt))  for  the  fraction  of  papers  achieving  the  zt  standard  given 
the  inital  beliefs.  By  the  envelope  theorem  we  know  that  in  a  neighborhood  of  z*(at)., 


dz 


rjzt)  —(zi{ar):tq{a,:z*{al))(zt  -  z*(at)) 


tq(at,z  (q())  -q{at,z*(at)) 
(1  -  at)atq{at,z*{at)) 
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The  mass  of  extra  papers  the  editor  accepts  when  he  lowers  the  standard  from  z*{at)  to 
z*(a+)  -  (h  is     I""1,   dz.  Hence,  in  a  neighborhood  of  z*{at)  we  have 


(1  -  at)atq  .    .         . 

ra(Mq;auzt)-Wt     =     ^__L(rB(«)  -  r) 

(1  -Qj) 

zt+l  -  zt  ~a{z*{at)  -  zt), 


This  gives 


for  a  =  jAt  as  desired. 

The  analysis  of  the  z,  <  s*(a«)  case  adds  another  slight  complication  --  the  formula 
for  L2  is  slightly  different  because  there  are  no  papers  in  the  triangle  bounded  by  {q,h{l  - 
tq)  +  a).  (q,ra(q,at,zt)),^d(q+{l-at){.ra{q,auzt)-r{q,atlzt))/auh(l-tq)+a)  that 
can  be  unexpectedly  rejected.  The  loss  function  thus  takes  the  form 


2a  +  1  , .  ,9/  ,2  .   ,,,.        ..\2  ,     1  r.„      „  nr...    ,.AA2 

24cr 

'  7(7-  >  r(<?;  m,  io))(r  -  r(g;  m,  w))dqdr, 


where  7  is  the  triangle  bounded  by  the  three  points  above  and  I  is  the  indicator  function. 
To  show  that  at+i  -  at  «  0  it  suffices  to  show  that  for  any  c>  0  there  exists  a  6  such  that 
\m  -  m,\  <  c(z*{at)  -  zt)  whenever  zt  6  (z*{at)  -  6,  z*(at)).  To  see  this  note  that  for  all  z, 
in  some  interval  below  z*(at)  we  have  for  any  m  <  m.,  -  c{z*{at)  -  zt) 

«  (m,.)     .     ?£!«,  -  ,)  »(m  -  ,„,,  -  -jA_£  /r  /(r  >  rfc  ,,  -))(r  -  rfe;  m.  „))  J 
<     -^-tl(*,-8)24z>t)-*) 

1Z<7 

+     ;    l     '  .     sup    1-^—  I(r>r{q;m,w)){r-r{q;m,w))\, 

<?(tq  -  q)  (q,r)er    °m 

where  ,4(T)  is  the  area  of  the  triangle  T.  The  area  of  the  triangle  is  ^^-{ra(q;at,Zt)  - 
r{q;  at,  zt))2 .  The  effect  of  a  dm  change  in  m  on  r(q;  m,  to)  is  <j  -  71/,,  which  is  largest  when 
q  is  farthest  from  Mq.  This  gives 

l^-Kto)  <  -2-^±{tq  -  q)2c(z*(at)  -  zt)  +  ±f^(ra(q;at,  zt)  -  r(q-at,zt))2. 
dm  12c  ~~  4atcr 

The  (ra(q:  a,,  zt)-r(q;  a,,  zt))2  term  is  a  second  order  effect  in  z*(at)-zt.  Hence,  -^{m.,w)  < 
0  for  all  m  <  m.t-c(z*{at)  -  zt)  when  zt  sufficiently  close  to  z*(at).  Combining  this  with  a 
similar  calculation  of  the  derivative  for  m  >  mt  +  c{z*{at)  -  z,)  allows  us  to  conclude  that 
m  e  {mi  -  c{z*{at)  -  zt),  w,  +  c{z*{at)  -  zt))  as  desired. 

A  similar  calculation  shows  that  the  result  on  zt+\  -  zt  is  also  unaffected  by  the  second 
order  change  in  the  loss  function. 
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(c)  When  the  equilibrium  has  the  typical  form  G(z\fq)  is  given  by 

G(z;tq)  = 


(l-a,)(h(l  -tq)+a/2) 


attq 


d2G, 


An  explirit  calculation  of  the  first  two  derivatives  of  this  function  shows  that  ^&r{z;  tq)  >  0 

in  a  neighborhood  of  {z*{at)-,tq{at,z*{at))).  This  again  implies  that  -g^(at,z*(at))  exists 
and  that  the  distribution  of  paper  qualities  has  the  typical  form  for  zt  in  some  neighborhood 
of  z* (at). 

The  (q,r{q))  line  and  the  set  of  unexpectedly  rejected  papers  have  exactly  the  same 
form  here  as  in  zt  <  z*{at)  case  of  part  (b)  of  the  proposition.  The  result  is  thus  identical 
to  the  result  for  that  case. 

(d)  As  above  tq  is  differentiable  in  z  at  z*(at).  The  distribution  of  paper  qualities  and  out- 
comes has  the  typical  form  pictured  in  Figure  6  for  all  zt  in  some  neighborhood  (z*(at),  z*(at)- 
5). 

Write  wa  for  ra{Mq;at1zt).  I  first  note  that  a  number  of  simple  comparisons  imply 
that  economists  must  infer  that  standards  arc  lower  and'g  is  less  important  than  they  had 
thought.  Specifically,  we  must  have  m  >  mt  and  w  6  {wn,  wt).  To  see  this,  note  first  that 
the  line  given  by  (rh,w)  can  not  be  entirely  above  the  (int,u't)  line  over  the  whole  range 
[q(at,Zt),tq(at,zt)]  of  (/-qualities  of  resubmitted  papers.  From  any  such  estimate  both  L\ 
and  Lo  are  decreased  by  moving  the  line  down  until  there  is  an  intersection.  Next,  note 
that  it  is  also  impossible  for  the  minimum  to  have  w  >  wt  with  the  (m,w)  line  intersecting 
the  (m.t.wt)  line  at  q  6  [<?,f<j].  In  this  case,  L\  and  Li  are  both  reduced  by  rotating  the 
fitted  line  about  the  point  were  it  intersects  the  line  r  =  ra(q;  mt,  wt)  in  the  direction  that 
reduces  w.  The  estimate's  also  can  not  have  w  <  wa.  In  this  case  increasing  u>  to  wa 
and  setting  m  =  m.t  reduces  L\  and  makes  L2  equal  to  its  global  minimum.  This  gives 
w  £  (rn(Mq;at,zt),wt).  It  is  then  easy  to  see  that  m  >  mt-  Otherwise,  slightly  reducing  m, 
would  reduce  L\  and  also  reduce  L2.  (The  latter  is  because  the  gain  from  to  improving  the 
fit  in  (<7,  min(g,  Mq))  is  greater  than  the  loss  from  worsening  the  fit  in  the  (possibly  empty) 
interval  \Mq,q\.)  Finally,  all  of  the  extreme  values  (m  =  mt,  w  —  wt  and  w  =  wa)  can  be 
ruled  out  by  looking  at  derivatives  of  the  loss  function. 

To  obtain  the  proposition's  further  conclusion  that  the  changes  in  m  and  w  are  both 
first  order  in  zt  —  z*(at)  I  examine  the  first  order  conditions  of  the  loss  function.  Even 
after  the  discussion  above  there  remain  a  few  possibilities  for  exactly  where  the  best  fit  line 
may  intersect  the  other  lines  in  the  figure:  it  may  be  above  or  below  ra(q;at,  zt)  at  the  left 
edge  and  may  be  above  or  below  r(tq;at,  zt)  at  the  right  edge.  The  first  order  conditions 
are  slightly  different  in  the  four  cases.  I  will  work  out  the  equations  for  the  simplest  case: 
assuming  that  the  best  fit  line  is  strictly  between  ra(q;at,  zt)  and  r(q;at,zt)  throughout 
the  interval  [q,  t.q\. 

In  this  case,  the  loss  function  is  very  similar  to  the  loss  function  in  the  cases  above. 
The  only  differences  are  that  the  set  of  unexpectedly  accepted  papers  does  not  have  papers 
with  (-/-qualities  above  q  and  is  also  missing  a  second-order  triangle  in  (q,  r)-space  below 
{qir{q\at,  zt)).  As  above,  the  second-order  triangle  can  be  ignored.  The  estimates  have 
the  same  asymptotics  as  those  obtained  by  minimizing  loss  functions  given  by  integrals 
identical  to  those  in  part  (a)  but  with  different  lower  and  upper  bounds.    Specifically  we 
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can  examine  the  minimizer  of  L  =  L\  +  L2  where 


Li(m,  w)  =  —{tq  -  qf{m  -  mt)2  +  [w  -  wt) 


and 


L2(m,w)  =  —  — — =-{m  -  mt)    +  *-- [w-wa-(m-  mt)—— 

24cr    tq  —  q  2a  tq  —  q  \  2 

(Note  that  I've  omitted  the  arguments  (at,zt)  arguments  of  tq,  q  and  q  to  improve  read- 
ability.) Trie  first  order  conditions  for  this  minimization  arc  of  the  form 

C\{m  -  rnf)  -  c2(ib  -  wa)     =     0 
—c2{rh  —  mt)  +  03(11)  —  w')     =     0, 

/  2a(tq-q)w,  +  (q-q)wa     .  . 

where  w  =  — 2a(f-  )+(~-~) —  1S  a  weig"tcd  average  or  wt  and  wa  and  c\,  c2  and  C3  are 
positive  constants:  C]  =  ±(fq  -  q)2  +  -^  ij^J  +  \{q  ~  Q)(fq  ~Jl)\  ^2  =  \j{tq-q)  ;  ancI 
C3  =  2  -l — 7    -  , .  Adding  C2/C3  times  the  second  equation  to  the  first  gives 

c2c3  2a  (fg  -  g) 

C1C3  -  cij,  2a(tg  -  q)  +  (q  -  q) 

We  know  that  the  true  minimum  has  m  >  ?n(  and  that  iu/  >  wa,  so  the  first  order  conditions 
of  this  case  can  only  give  the  true  minimum  if  the  constant  in  this  expression  is  positive. 
The  equations  also  give 

2  _  g-g 

c2      CiC32a{u_q)+{-_ql) 

w  -  wt  =  \ (wt  -  wa), 

C1C3  -  c\ 

From  the  discussion  above  this  can  be  the  true  minumum  only  if  the  leading  constant  is 
negative. 

The  fact  that  wt  —  wa  ~  (zt  —  z*(at))/(l  —  at)  implies  that  m  —  nit  and  w  —  wt  are  first 
order  in  Zi  —  z*(at).  The  longer  expression  for  zt+i  —  zt  follows  from  the  calculation 

z-zt     =     aMq  + (l-a)w  -  (atMq  + (1  -at)wt) 

=     (1  -  at)(w  -  wt)  +  Mq(a  -  at)  -  w(a  -  at) 

=     (1  -  at)(w  -  wt)  +  (Mq  -  Mr){a  -  a,)  +  (MT  -  w)(a  -  at) 

The  last  term  in  this  expression  is  second  order  in  zt  —  z*(c*t). 
QED. 

Proof  of  Proposition  7 
(a)  Recall  from  the  proof  of  Proposition  6  that  t.q(at,  zt)  is  differentiablc  in  z  at  z*(at)  and 
that  the  distribution  of  paper  qualities  will  also  have  the  low  a  or  somewhat  low  a  form 
for  2,  in  a  neighborhood  of  z*(at).    The  same  computation  as  in  the  proof  of  part  (b)  of 
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Proposition  6  shows  that  for  zt  in  some  neighborhood  (z*(at),  z*(at)  +  5)  the  loss  function 
is  of  the  form  L  =  L\  +  L2  with 

1 
12' 


Li{m,w;mt,wt)  ^(tq  -qf{m-  m,)2  +  (w  -  (wt  +  e))2 


1    ,.  ,->,  ,9        1 


L2(m.,w;mt,wt)     =     ^r~(*9  ~  ?)  (m  _  rn'*)    +  ^"C"7  ~~  ra(Mq;atj  zt)) 


24a"  q      -  '""     '    2 


<T 


Differentiating,  (mt,wt)  is  a  steady  state  if  and  only  if  wt  —  ra{Mq;at,  zt)  =  2<tc.  In  the 
previous  proof  we  also  saw  that  wt  —  ra(Mq;at,  zt)  m  j^-(zt  —  2*(»f))-  Hence,  for  e 
sufficiently  small  we  can  find  a  zt  >  z*(at)  that  satisfies  the  equation  for  a  steady  state. 

In  the  somewhat  low  a  case  the  equations  above  give  zi{a)  —  z*{a)  ~  2(1  —  a)ae.  In 
the  low  a  case  we  saw  earlier  that  the  expression  wt  —  ra(Mq;  Qt.,zt)  =  ,_a  (zt  —  z*(at))  is 
exact  when  zt  is  sufficiently  close  to  z*(at),  and  hence  the  expression  for  ze(a)  —  z*(a)  is 
an  equality  as  well. 

(b)  Again  for  a  given  at  the  distribution  of  papers  has  the  typical  form  if  zt  is  sufficiently 
close  to  z*(at).  For  wt  in  a  neighborhood  of  r(Mq;at,zt)  and  (m,  w)  sufficiently  close  to 
(mt,  wt)  with  m  <  vn-t ,  w  >  wt  the  expression  for  the  loss  function  is  again  a  slight  variant 
of  that  in  the  previous  proposition: 


1 
12 


Li(m,w\mt,  wt)  —{tq  -  q)2(m  -  m,)2  +  (w  -  (wt  +  e))' 


i   f q-q \  ,         ,2     !  q ~ q  (  ,  Jq-q 

—  I  w  —  - 


L2{m,  w;  m.t,vit)     ~     — — (m  -  mt)    +  —  - =■     w  -  wa  -  (m  -  rn, 

24cr  \tq  -  qj  2atq  -  q\ 

where  again  I've  written  wa  for  ra(A/g;o/,  zj)  to  save  space.  A  direct  computation  shows 
that  jj^ (mt ,  wt;  m; ,  wt)  =  0  and  g^jfVnt,  W/.;mt>  Wt)  <  0  when  (mf,wt)  corresponds  to  a 
consistent  social  norm.  This  is  the  first  result  mentioned  in  part  (b). 

We  will  find  fj  =  0  with  L  parameterized  by  (a,  z)  if  and  only  if  |^  =  0  when  L  is  pa- 
rameterized by  (m,  w).  A  simple  calculation  of  derivatives  shows  that  §^(?"«,  Wt\  mt,wt)  =  0 
if  and  only  if 

tq  ~  q 

wt  ~  u>a  =  2a— =e  =  0. 

q-q 

As  above  this  has  a  solution  zt  with  zt  —  z*(at)  ~  2(1  —  at)a-S—^€.  This  establishes  the  next 

claim  in  part  (b)  and  ihe  fact  that  the  smallest  solution  has  ze(at)  —  z*(at)  ~  ae  for  some 
a  >  0. 

If  (mt.Wt)  were  a  steady  state  of  the  dynamics  in  which  the  distribution  of  paper 
qualities  had  the  typical  form  it  would  have  to  satisfy  the  first-order  condition  above  and  the 
additional  constraint  that  §^(mt,Wt;m.t;wt)  =  0.  We  know  that  Q^(mt,wt;rnt;wt)  =  0 
for  any  m.t  and  wt.  The  w  FOC  can  only  be  satisfied  for  wt  >  r(Mq;  at,  zt).  For  such  values 
of  wt  and  for  m  <  m.t  sufficiently  close  to  mt  we  have 


1  /    /('■9-£)/2  i(m.-m.t)q  f{t<,-q)/2     r 

L2{m.  Wt\m.t,  Wt)  =  — —     — -      /  /  rdrdq  +   /  / 

aVq  ~  q)    \./max(0,9-A/,)  Vo  ' Mq-q  '0 


Wt—  wn  —  (m.— m,  )q  ^ 

rdrdq 


The  first  term  is  the  surprise  due  to  the  unexpected  acceptance  of  papers  with  ^-qualities 
above  q,  and  the  second  is  the  suprise  due  to  the  unexpected  acceptance  of  papers  with 

;-;,s 


lower  g-qualities.  Evaluating  the  integrals  and  differentiating  gives 

0L2/  (tq  -  q)2  -  l(q  -  Mqf 

——{m.t,w,:,m,t,wt)  = —7- : {wt  -wa). 

dm,  orr{t.q  -  q) 

The  derivative  with  respect  to  a  is  simply  this  expression  divided  by  (1  —  at)2 .  This 
is  negative,  which  is  the  second  conclusion  of  part  (b).  It  is  also  immediate  from  the 
expression  that  Q^(at,  zt;at,  zt)  «  be. 

Looking  at  the  numerator  in  the  equation  for  ^  we  can  also  see  that  the  gain  from 
reducing  a  vanishes  as  we  approach  the  somewhat  low  a  region  and  q  approaches  tq  (or  as 
a  approaches  one  and  q  approaches  q.) 

QED. 
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